EXPERIMENTAL  TESTS  OF  INOEPENCENCE  ASSUMPTIONS  FOR  RISKY  MULTI A— ETC (U) 
OCT  76  D VON  WINTERFELDT  N00014-76-C-0074 


fCr- 


0018134-T 


social  science 

research  institute 


TECHNICAL  REPORT 

Experimental  Tests  of  Independence  Assumptions 
for  Risky  Multiattribute  Preferences 

Detlof  von  Winterfeldt 


Sponsored  by: 

Advanced  Research  Projects  Agency 
Department  of  Defense 


Monitored  by: 

Engineering  Psychology  Programs 
Office  of  Naval  Research 
Contract  No.  N00014-76-C-0074,  ARPA 


Approved  for  Public  Release  ; 
Distribution  Unlimited; 
Reproduction  in  Whole  or  in  Part  is  Permitted 
for  Any  Usb  op  the  U.S.  Government 


U aw*  : 

ifctseji 


October  1976 


SSRI  Research  Report  76-8 


Technical 


EXPERIMENTAL  TESTS  OF  INDEPENDENCE  ASSUMPTIONS 


FOR  RISKY  MULTIATTRIBUTE  PREFERENCES, 


Detlofy  von  Winterfeldt 


Defense  Advanced  Research  Projects  Agency 
Under  Subcontract  from 
I Decisions  and  Designs,  Incorporated 


SOCIAL  SCIENCE  RESEARCH  INSTITUTE 
University  of  Southern  California 
Los  Angeles,  California  90007 


SUMMARY 

The  purpose  of  this  experiment  was  to  analyze  models 
of  human  preferences  in  complex  decision  situations  that 
are  characterized  by  uncertainty  and  multiple  attributes 
of  outcomes.  Four  basic  models  for  such  risky  multiattribute 
v preferences  were  considered,  among  them  the  additive  and 

multiplicative  expected  utility  models.  Independence 

* 

assumptions  that  cai>  test  the  descriptive  validity  of 
these  models  were  formulated. 

The  validity  of  the  independence  assumptions,  includ- 
ing the  marginality  assumption  and  utility  independence, 
was  tested  for  subjects'  preferences  among  even  chance 
gambles  for  commodity  bundles  containing  gasoline  and 
ground  beef.  Subjects  matched  gatnbles  or  commodity  bundles 
against  a standard  and  these  matches  were  checked  to  see 
if  the  indifference  held  in  various  stimulus  contexts  as 
required  by  the  independence  assumptions.  Effects  of 
response  modes,  instructions,  and  personal  preference 
characteristics  were  examined. 

% 


All  independence  assumptions  and  models  were  violated 
by  a bias  to  prefer  a gamble  or  commodity  bundle  that  was 


previously  matched  against  a standard,  independently  of 
context.  Systematic  and  strong  violations  of  the  marginal- 
ity  assumption  were  found  in  form  of  a multivariate  risk 
aversion:  subjects  tended  to  prefer  a gamble  with  more 

balanced  multiple  outcomes  over  a gamble  with  extreme 
multiple  outcomes,  even  if  all  single  outcomes  had  an 
equal  chance  of  occurring.  Both  the  bias  and  multivariate 
risk  aversion  were  independent  of  response  modes  and 
instructions.  Other  preference  characteristics  such  as 
single  attribute  risk  attitude  and  preferential  inter- 
action of  commodities  seemed  unrelated  to  multivariate 
risk  aversion. 

The  bias  to  prefer  a previously  matched  gamble  over  a 
standard  cannot  be  explained  by  any  traditional  model 
describing  risky  multiattribute  preferences.  This  bias 
could  be  due  either  to  mismatching  or  to  a change  in  pre- 
ferences after  matching.  The  phenomenon  of  multivariate 
risk  aversion  proved  to  be  a stable  property  of  risky 
multiattribute  preferences  for  the  stimuli  considered. 
Descriptive  models  for  risky  multiattribute  preferences 
will  have  to  take  this  phenomenon  into  account  in  similar 
stimulus  situations.  For  normative  modelling  the  results 
of  the  experiment  indicate  the  necessity  to  carefully 
check  the  consistency  of  preferences  assessed  by  procedures 
that  are  based  on  indifference  judgments  and  to  compare 


. 


them  with  actual  choices.  The  multivariate  risk  aversion 
effect  suggests  that  simple  additive  expected  utility 


models  may,  in  some  cases,  be  inappropriate  for  prescribing 
preferences.  Checks  of  the  marginality  assumption  and 
analyses  of  the  form  of  multivariate  risk  aversion  should 
be  designed  and  tested  carefully,  before  modelling  decision 
makers'  preferences  with  additive  expected  utility  models. 
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INTRODUCTION 

Some  of  the  most  interesting  and  common  difficulties  in 
human  decision  making  arise  when  decisions  are  risky  and 
their  outcomes  have  several  value  aspects  or  attributes.  A 
decision  is  said  to  be  risky  if  the  decision  maker  cannot 
predict  with  certainty  which  outcome  it  will  produce.  Gam- 
bling and  buying  insurance  policies  are  the  most  typical 
examples  for  decision  making  under  risk.  Should  one  buy 
collision  insurance  for  an  old  and  rusty  VW?  Or  should  one 
bet  on  a good  looking  long  shot  in  a horse  race?  Such  deci- 
sion situations  confront  the  decision  maker  with  the  problem 
of  trading  off  probabilities  of  outcomes  against  their  bene- 
fits and  losses.  These  trade-offs  are  inherently  subjective, 
depending  on  the  decision  maker's  present  assets,  his  risk 
attitudes,  his  likelihood  estimates  and  his  subjective 
evaluations  of  costs  and  benefits.  There  are  no  clear  cut 
rules  or  formulas,  and  risky  decision  making  problems  can 
rarely  be  solved  without  much  difficult  deliberation. 

A decision  problem  is  called  multiattributed  if  deci- 
sion outcomes  vary  in  more  than  one  value  relevant  dimension 
or  attribute.  Examples  are  buying  a new  car,  renting  an 
apartment,  allocating  funds  to  social  programs,  or  industrial 
facility  siting.  Is  an  ocean  view  really  worth  an  additional 
$50  in  rent  for  an  apartment?  Or  is  a 10  mile  increase  in 
the  distance  of  an  industrial  site  from  a densely  populated 
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area  worth  a $100,000  increase  in  the  costs  of  building  ac- 
cess roads?  Solving  such  multiattribute  problems  requires 
trade-offs  among  benefits  and  losses  in  one  attribute 
against  benefits  and  losses  in  another.  As  in  risky  decision 
making  such  trade-offs  are  subjective  and  no  clear  cut  rules 
exist  for  making  them.  How  a decision  maker  trades  off  out- 
come attributes  against  others  depends  on  his  subjective 
judgments  of  the  relative  importance  of  attributes  and  his 
evaluations  of  outcomes  within  each  attribute. 

Since  risky  and  multiattribute  decision  problems  are 
common,  problematic,  and  inherently  subjective,  they  have 
been  the  most  frequently  studied  paradigms  in  behavioral 
decision  theory.  Other  problems  such  as  time  or  group  pref- 
erences have  received  comparatively  little  attention.  In  the 
late  50' s and  the  early  60' s most  theoretical  and  experi- 
mental research  concentrated  on  the  single  attribute  risky 
paradigm,  i.e.,  on  decision  problems  that  are  characterized 
by  single-dimensional  outcomes  that  are  uncertain.  Since 
the  end  of  the  60 's,  decision  theoretic  research  has  shifted 
to  problems  of  multiattribute  decision  making.  But  while 
most  of  the  theoretical  research  dealt  with  the  joint  risky 
and  multiattribute  decision  problem  in  which  outcomes  are 
both  risky  and  multiattributed,  experimental  work  was  almost 
exclusively  restricted  to  the  riskless  multiattribute  para- 
digm in  which  outcomes  are  multiattributed  but  certain.  The 
experiment  described  in  this  thesis  tried  to  fill  this  gap 
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in  the  experimental  literature.  Its  purpose  was  to  provide 
a deeper  understanding  of  the  behavioral  phenomena  that 

t 

characterize  preferences  in  joint  risky  and  multiattribute 
decision  problems. 

Before  discussing  the  theoretical  and  experimental 
literature  of  this  joint  problem  of  risky  multiattribute 
preferences,  it  is  useful  to  summarize  briefly  the  main 
results  on  the  separate  topics  of  risky  single  attribute 
preferences  and  riskless  multiattribute  preferences.  Most 
of  the  experimental  literature  on  human  decision  making 
under  risk  examined  subjects'  preferences  among  gambles  for 
money  as  a prototypical  decision  problem.  The  most  prominent 
models  to  describe  such  risky  preferences  are  expectation 
models  which  assume  that  subjects  order  gambles  according  to 
some  expected  value,  where  subjective  transformations  may  be 
allowed  on  probabilities  and  on  monetary  values.  Expecta- 
tion models  vary  in  the  degree  to  which  they  allow  such 
transformations,  ranging  from  the  strict  expected  value 
model  (EV)  that  does  not  allow  either  transformation  to  the 
subjective  expected  utility  model  (SEU)  that  allows  both, 
v.  Neumann  and  Morgenstern  (1947)  laid  the  foundation  of 
expectation  models,  that  was  later  modified  by  Savage  (1954) 
and  Luce  and  Krantz  (1971) . Alternative  models  to  describe 
human  preferences  under  risk  are  portfolio  theory  (Coombs, 
1972)  and  minimax  or  minimax  regret  models  (see  Wendt,  1970) . 
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In  portfolio  theory,  subjects  are  assumed  to  trade-off  the 
expected  monetary  value  of  a gamble  against  some  perceived 
quantity,  called  the  "risk"  of  the  gamble.  Minimax  and  mini- 
max regret  models  completely  ignore  probabilities  and  require 
that  decisions  are  made  on  the  basis  of  outcomes  alone. 

Psychological  research  has  almost  exclusively  studied 
expectation  models.  Most  of  the  experimental  results  are 
summarized  in  Edwards  (1954,  1961),  Becker  and  MacClintock 
(1967) , and  Lee  (1971) . The  main  conclusions  of  these 
experiments  were: 

1)  When  compared  with  global  preferences  among  gambles 
for  monetary  outcomes,  expectation  models  describe 
subjects'  preferences  well. 

2)  Experimental  situations  can  be  created  in  which 
subjects  consistently  violate  most  of  the  assump- 
tions and  implications  of  expectation  models, 
including  many  invariance  assumptions  (sure  thing, 
irrelevance  of  context  gambles,  etc.)  and  even 
transitivity. 

This  contradiction  between  global  validity  and  specific 
violations  has  not  been  resolved  up  to  now.  Proponents  of 
alternative  theories  (Coombs,  1972)  consider  expectation 
models  invalid  in  principle.  Other  researchers  stress  the 
global  validity  of  expectation  models  which  they  consider 
hardly  shaken  by  the  specific  evidence  against  them.  No 
matter  what  position  one  takes,  however,  the  fact  remains 
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that  expectation  models  have  at  present  no  serious  and 
well  tested  contender  among  descriptive  models  for  human 
preferences . 

Riskless  multiattribute  preference  models  are  mainly 
decomposition  models  that  express  how  the  values  of  single 
attribute  evaluations  are  aggregated  to  a total  value  of 
a multiattribute  outcome.  The  single  most  prominent 
model  of  this  class  is  the  weighted  additive  model  in 
which  numerical  single  attribute  evaluations  are  weighted 
by  an  attribute  importance  weight  and  added  across  attri- 
butes. The  model  assumes  that  the  order  of  these  weighted 
sums  describes  the  actual  preference  order  of  the  decision 
maker.  Theoretical  foundations  of  such  decomposition 
models  can  be  found  in  conjoint  measurement  theory 
(Krantz,  Luce,  Suppes,  and  Tversky,  1971)  and  in  decom- 
position forms  of  bisymmetric  measurement  models 
(Fishburn,  1975) . 

In  the  experimental  research  typically  a simple 
weighted  additive  model  was  constructed,  value  indices 
were  generated  for  a set  of  outcomes,  and  the  correlation 
between  these  value  indices  and  subjects'  global  numeri- 
cal evaluations  or  preference  orders  was  computed  to 
indicate  the  degree  to  which  the  additive  model  described 
subjects'  preferences.  Several  such  validation  experi- 
ments were  performed  in  the  early  70' s,  all  of  which  are 
best  summarized  in  Fischer  (1975).  Correlations  between 
the  value  indices  generated  by  the  additive  model  and  the 
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global  evaluations  or  preference  rank  orderings  were 
usually  in  the  . 80's,  often  in  the  .90's,  depending  on  the 
number  of  attributes.  This  result  has  typically  been 
interpreted  as  a high  descriptive  validity  of  additive 
models  in  complex  evaluation  or  preference  problems. 

Another  interpretation  of  these  results  is  that  addi- 
tive models  can  predict  people's  preferences  not  because 
they  are  additive,  but  rather  because  additive  models  can 
approximate  non-additive  preferences  very  well.  This 
argument  is  best  developed  in  Dawes  and  Corrigan  (1974)  , 
who  conclude  that  under  two  conditions  additive  models 
will  be  good  predictors  of  non-additive  preferences:  if 
preference  judgments  are  measured  with  error,  and  if  pref- 
erences are  conditionally  monotone  in  each  attribute  (that 
is,  if  more  in  each  attribute  is  preferred  to  less, 
independently  of  other  constant  attribute  values) . 

These  experimental  results  on  risky  single  attribute 
preferences  and  riskless  multiattribute  preferences  are 
not  sufficient  to  understand  the  nature  of  preferences  when 
outcomes  are  both  uncertain  and  multiattributed.  There  are 
many  interesting  problems  that  arise  specifically  from  the 
interaction  between  risk  and  multiple  attributes  that  have 
not  yet  been  studied  experimentally,  although  the  rich 
theoretical  literature  on  risky  multiattribute  preference 
models  offers  several  hypotheses  for  experimental  investi- 
gation. The  experiment  described  in  this  thesis  was 
designed  to  test  some  of  the  main  assumptions  underlying 
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such  models,  and  to  analyze  the  behavioral  consequences 
of  the  interaction  between  risk  and  multiple  outcomes  in 
decision  making. 

In  the  following  section  the  joint  risky  multi- 
attribute paradigm  will  be  discussed  in  detail,  various 
model  forms  for  risky  multiattribute  preferences  and  their 
behavioral  assumptions  will  be  described,  and  the  few 
experimental  results  that  bear  on  an  understanding  of  the 
risky  multiattribute  problem  will  be  summarized.  This 
discussion  will  then  lead  to  the  formulations  of  the 
. hypotheses  and  experimental  questions  of  this  thesis, 

followed  by  the  experimental  methods  and  results.  The 
implications  of  the  theoretical  discussions  and  the  exper- 
imental results  for  an  understanding  of  human  decision 
making  under  risk  and  with  multiple  attributes  of  outcomes 
will  be  discussed.  Possible  problems  in  real  world 
applications  of  risky  multiattribute  preference  models 
will  be  considered  in  the  light  of  the  experimental  data. 


JOINT  RISKY  AND  MULTIATTRIBUTE  PREFERENCES 


This  section  will  formally  present  the  risky  multi- 
attribute decision  making  paradigm,  explain  the  main  model 
classes  that  describe  or  prescribe  human  preferences  in  this 
paradigm,  and  summarize  the  results  of  four  experiments  that 
tested  some  of  the  models.  The  terminology  used  in  this 
section  was  adapted  from  Krantz , Luce,  Suppes  and  Tverky's 
"Foundations  of  Measurement"  (1971).  Other  important  sources 
that  may  help  to  understand  the  model  expositions  are 
Fishburn  and  Keeney  (1974),  and  Keeney  and  Raiffa  (in  press). 


Paradigm 

To  recap:  a risky  and  multiattribute  decision  problem 
is  a problem  in  which  a decision  maker  has  to  select  among 
alternative  courses  of  action  that  produce  uncertain  multi- 
attributed  outcomes.  Many  important  individual  or 
organizational  decision  problems  are  of  this  type.  An 
individual  decision  making  example  is  the  problem  of  job 
selection,  where  jobs  may  have  many  value  relevant  aspects 
such  as  salary,  location,  staff  benefits,  professional 
climate.  Some  of  these  aspects  may  be  uncertain,  such  as 


the  chances  for  promotion  or  salary  increases.  An  organi 


zational  example  is  the  siting  of  industrial  plants.  Sites 
vary  on  many  strongly  weighed  cost  and  benefit  attributes 
such  as  costs  for  building  transmission  lines,  costs  of 
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land  acquisition,  environmental  costs,  geological  qualities, 
etc.  The  risks  involved  in  siting  decisions  (e.g.,  risk  of 
earthquakes)  and  the  uncertainty  of  being  able  to  measure 
all  outcomes  precisely  (e.g.,  the  exact  costs)  make  this 
multiattribute  problem  also  risky. 

A paradigmatic  way  to  structure  the  problem  of  selecting 
among  risky  and  multiattributed  alternatives  uses  vectors  and 
matrices  as  descriptions  of  risky  multiattribute  alternatives. 
Consider,  for  example,  that  the  choice  alternatives  are 
gambles  for  commodity  bundles  containing  certain  amounts  of 
gasoline  and  ground  beef.  A specific  gamble  may  pay  off  10 
gallons  of  gasoline  and  8 pounds  of  ground  beef,  if  heads 
come  up  on  the  flip  of  a coin,  or  nothing  at  all,  if  tails 
come  up.  This  gamble  can  be  described  by  a 2 x 2 matrix 
in  which  the  rows  are  single  attribute  gambles,  and  the 
columns  are  single  event  outcomes: 


A EVENTS 

T 

T HEADS  TAILS 


R 

I 

GAS 

10 

0 

B 

U 

BEEF 

8 

0 

T 

E 


If  all  outcomes  are  contingent  on  the  same  events,  risky 
and  multiattribute  choice  alternatives  can  be  described  as 
n x m matrices: 


10 


EVENTS 


E1 

E2  

» • • • E . •••< 

D 

. . . . E 

m 

X,  , 

Y 

11 

X12 

X]_  j * * * ' 

lm 

X-  . 

V 

y 

, . . . V 

21 

22  

x2j  

x2m 

The  a^'s  are  the  attributes  under  consideration,  the  E ^ ' s 
are  the  events  that  determine  which  multiattributed  outcome 
the  decision  maker  will  receive,  and  the  x. .'s  are  amounts 
(or  other  real  valued  descriptions)  in  attribute  i of  the 
outcome  one  receives  if  event  j occurs.  The  multiattribute 
risky  decision  paradigm  then  reduces  to  the  study  of  choices 
among  matrices  of  the  above  form.  The  set  of  matrices  that 
is  available  to  the  decision  maker  at  the  time  he  has  to 
make  his  selection (s)  is  called  the  choice  set,  labelled  X, 
with  typical  elements  x,  y,  z.  The  i-th  row  vector,  x^,  is 
the  gamble  within  the  i-th  attribute.  The  j-th  column 
vector,  x j , is  the  multiattribute  outcome  that  the  decision 
maker  receives  if  event  j occurs.  In  the  following,  matrix 


elements  x^j  and  y^j  will  be  used  to  distinguish  elements  of 
two  choice  alternatives  that  are  different.  If  elements  are 
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constant  across  choice  alternatives,  they  will  be  labelled 
a^j  or  and  subscripts  will  be  dropped,  if  elements  are 

equal  within  a choice  alternative. 

Models  and  Independence  Assumptions 

Preference  models  are  numerical  descriptions  of  a deci- 
sion maker's  preferences.  They  assign  numbers  to  the  elements 
in  the  choice  set  X such  that  the  numerical  order  coincides 
with  the  decision  maker's  natural  preference  order  among  ele- 
ments in  X,  and  such  that  some  numerical  properties  reflect 
other  qualitative  properties  of  the  decision  maker's  prefer- 
ence order.  All  preference  models  for  risky  multiattribute 
preferences  that  will  be  described  in  the  following  para- 
graphs are  based  on  the  fundamental  weak  order  model.  The 
weak  order  model  assumes  that  there  exists  a function  u from 
the  choice  set  X into  the  real  numbers  R such  that  the  numeri- 
cal values  that  u assigns  to  elements  in  X preserve  the  order 
of  the  decision  maker's  preferences.  Formally,  this  weak 
order  model  has  the  following  representation: 

Weak  Order  Model 

There  exists  a real  valued  function  u 
u : X R 
such  that 

x i y 

if  and  only  if 
u(x)  > u (y) 
for  all  x,  y e X, 
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where  "x  > y"  means  "y  is  not  preferred  to  x",  and  ">"  is 

'V  — 

the  usual  order  relation  among  the  real  numbers. 

Besides  some  technical  assumptions,  the  main  behavioral 
condition  on  the  preferences  relation  > is  transitivity  of 
preferences,  formally  stated  as: 

Transitivity 


if 

x > 

and 

y > 
■*  % 

then 

X > 

'V, 

for  all  x. 

Yr 

Beyond  the  weak  order  model  for  preferences  among  risky  and 
multiattribute  decisions,  scores  of  models  exist  that  decom- 
pose u into  an  aggregate  of  functions  of  subsets  of  events 
and/or  attributes.  The  behavioral  axioms  that  distinguish 
these  models  are  called  independence  assumptions.  Indepen- 
dence assumptions  express  in  terms  of  the  preference 
relation  which  manipulations  of  elements  in  X leave  their 
preference  ordering  unaffected.  For  example,  a very  weak 
independence  assumption  could  require  that  deleting  an  ele- 
ment from  a subset  S of  X does  not  change  the  preference 
ordering  of  the  remaining  elements  in  S.  A much  stronger 
independence  assumption  could  postulate  that  preferences 
among  elements  in  X be  unaffected  by  scalar  multiplication 
of  row  or  column  vectors. 
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Two  main  model  classes  can  be  distinguished  on  the 
basis  of  independence  assumptions  called  "row  independence" 
and  "column  independence."  According  to  row  independence, 
preferences  among  elements  in  X that  vary  only  in  some  rows 
are  independent  of  the  particular  values  in  the  remaining 
(constant)  rows.  Or  to  put  it  another  way,  preferences  are 
determined  solely  by  the  marginal  (single  attribute)  proba- 
bility distributions.  Row  independence  can  be  formulated 
as  follows: 


Row  Independence 


a . 
x 


n 


> 

% 


1 

i>* 

i — 

*1 

*1 

X2 

?2 

• 

• 

if  and 

• 

• 

• 

• 

• 

• 

> 

•i 

• 

only  if 

b. 

l 

• 

• 

b. 

l 

• 

• 

• 

A 

• 

X 

n 

• 

A 

for  all  x,  y e X,  a^ , b^ 


x 1,2, ... ,n. 


The  fact  that  this  formulation  of  row  independence  only 
uses  one  constant  context  row  a is  of  no  importance,  since 
all  cases  in  which  preferences  are  conditional  on  more  than 
one  constant  row  are  implied. 
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Symmetrically , column  independence  requires  that  pref- 
erences among  elements  in  X that  vary  only  in  some  columns 
be  independent  of  the  particular  values  in  the  other  constant 
columns.  In  other  words,  column  independence  assumes  that 
preferences  among  elements  in  X are  dependent  only  on  those 
events  in  which  these  elements  vary,  and  not  on  those  events 
in  which  they  have  identical  outcomes.  This  assumption  is 
the  usual  sure  thing  assumption  in  expectation  models,  here 
applied  to  multiattributed  outcomes.  Formally  column  inde- 
pendence can  be  stated  as  follows: 


Column  Independence 


(x1,x2. 


■Sj V K (— 1 2 ■ 


(x1,x2, ,bj  , 


if  and  only  if 

'2n>  K 


for  all  x,  y e X,  a ^ , b ^ , and 

^ 1,2,..., m . 


Again  the  fact  that  the  j-th  column  was  singled  out  as  a 
conditioning  column  is  of  no  importance  for  the  formulation 
of  column  independence,  which  applied  to  any  combination  of 
conditioning  columns. 

If  neither  row  nor  column  independence  holds,  no  simple 
decomposition  of  u into  single  attribute  or  event  functions 
is  possible.  If  row  independence  holds,  but  column  inde- 
pendence fails,  u can  be  expressed  as  an  additive  composition 
of  single  row  (single  attribute)  functions  v^.  This  model 
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will  be  called  Model  1,  and  it  can  be  formally  stated  as 


follows : 


Model  1 


u(x)  = l v. (x, ) 
i=l  1 1 


For  example.  Model  1 would  be  an  accurate  description  of  a 
decision  maker's  preferences  if  he  ordered  elements  in  X 
purely  on  the  basis  of  their  single  attribute  certainty 
equivalents.  Model  1 has  not  yet  been  studied  in  the  theo- 
retical literature,  nor  have  its  theoretical  implications 
been  tested.  However,  Model  1 has  been  analyzed  in  connec- 
tion with  expectation  models  that  are  based  on  the  column 
independence  assumption. 

When  outcomes  are  multiattributed  column  independence 
is  a necessary  condition  for  all  models  that  express  utili- 
ties of  elements  in  X as  an  expected  value  of  single  outcome 
utilities.  When  applied  to  multiattributed  outcomes,  all 
expectation  models  assume  that  preferences  are  column 
independent.  Models  that  are  based  on  column  independence 
can  be  formalized  by  the  following  additive  combination  of 
single  event  utility  functions  u^: 


Model  2 


ill 

u (x)  = l u . (x  . ) . 

j=l  3 -3 


The  SEU  model  (see  Savage,  1954;  Luce  and  Krantz,  1971)  and 
all  other  expectation  models  are  special  versions  of  Model 
2,  in  which  u^tx^)  = CjU(x^);  c^  > 0: 


^ ; 


I 
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m 

Model  2 (SEU)  u(x)  = J p{E^)*u(x^). 

j=i  J 

The  additional  assumption  of  the  SEU  model  is  a very  natural 
condition,  since  it  is  unlikely  that  the  shape  of  the 
utility  functions  u^  should  depend  on  the  event  E ^ . Note, 
however,  that  a similar  strengthening  of  Model  1 would  not 
be  reasonable,  since  there  is  little  theoretical  basis 
for  assuming  that  single  attribute  utility  functions  have 
identical  shapes. 

If  preferences  satisfy  both  row  and  column  independence 
they  can  be  described  by  Model  3,  thac.  combines  the  SEU 
version  of  Model  2 with  the  additive  row  Model  1: 

m n 

Model  3 u(x)  = Y p (E . ) • y u.  (x^). 

j=l  11  i=l  J 

This  model  is  the  special  case  of  Model  2 where  u(x^)  is  an 
additive  function  of  single  attribute  utility  functions 

ui <xij>- 

Model  3 was  first  formulated  by  Fishburn  (1965)  and 
Poliak  (1967).  It  was  subsequently  generalized  by  Fishburn 
(1970),  and  by  Fishburn  and  Keeney  (1974).  But  Fishburn 
arrived  at  Model  3 not  by  assuming  a combination  of  row  and 
column  independence,  but  by  a slightly  different  path.  His 
theory  postulated  that  a utility  function  over  elements  in 
X satisfy  the  SEU  model.  By  adding  assumptions  that  are 
special  versions  of  row  independence.  Model  3 could  then  be 
proven.  Later  research  by  Keeney  (1968,  1974),  Fishburn 
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(1973,  1974),  and  Farquhar  (1974)  used  this  route  to  develop 
some  weaker  decomposition  forms  for  u(Xj). 

To  explain  this  line  of  thought,  consider  that  column 
independence  is  valid  in  its  SEU-version.  Also  consider  the 
case  in  which  row  independence  in  its  general  form  is  violated. 
There  are  various  weaker  assumptions  about  preferences  that 
lead  to  decompositions  of  the  expected  utility  function  u in 
Model  2. 


t 


One  such  assumption  is  called  utility  independence, 
formulated  by  Poliak  (1967)  and  Keeney  (1968,  1974).  As  a 
special  case  of  row  independence,  utility  independence  is 
weaker  than  the  row  independence  assumption  formulated  above. 
Utility  independence  requires  row  independence  to  hold  only 
if  the  conditioning  row  that  is  identical  across  the  com- 
pared matrices  has  equal  elements  in  all  events.  Formally 
utility  independence  can  be  stated  as  follows: 


Utility  Independence 


— — 

- — 

- - 

*1 

*1 

*1 

?1 

x2 

y 2 

X2 

*2 

• 

• 

• 

• 

• 

> 

if  and 

• 

> 

• 

• 

_ 

'Xj 

_ 

_ 

0/ 

a . 

l 

a . 
i 

only  if 

b. 

l 

b. 

X 

• 

• 

• 

• 

• 

• 

• 

• 

• 

• 

• 

• 

X 

y 

X 

y 

n 

Jn 
_ 

n 

Jn 

for  all  x,  y e X,  a^,  b^, 
a^j  = const,  b^j  = const 


|'j 

i 

j 

. \ 


1 1 


■i 

1 


X 1,2,. «.,n,  j 1,2,..., m . 
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If  the  decision  maker's  preferences  satisfy  utility 
independence,  they  can  be  described  by  a multiplicative 
model : 

m n 

Model  4 1 + ku(x)  = J p(E . ) • n [l  + ku^  (x. ..  )1 . 

j=l  3 i=l  1 13 

If  utility  independence  holds,  another  weak  version  of 
row  independence  called  marginality  (Raiffa,  1969)  , value 
independence  (Pishburn  and  Keeney,  1974)  , or  additive  inde- 
pendence (Keeney  and  Raiffa,  in  press)  can  prove  Model  3. 
Marginality  requires  row  independence  to  hold  only  for  gam- 
bles that  have  equally  likely  events  E^.  That  is  when 
p (E j ) = 1/m.  Marginality  postulates  that  a decision  maker 
be  indifferent  among  gambles  with  equally  likely  events  if 
they  can  be  made  identical  by  exchanging  elements  x^j  within 
rows.  In  other  words,  permuting  row  elements  in  x should  not 
change  its  utility  to  the  decision  maker.  Formally,  margin- 
ality can  be  stated  as  follows: 


Marginality 


E1 

• • • E i 
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• • • E 
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E1 
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...  E 
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Although  there  are  many  other  forms  of  models  that  apply 
to  the  risky  multiattribute  decision  making  situation,  Models 
1 and  2 are  in  a sense  the  most  fundamental  ones.  All  other 
decomposition  forms  that  have  been  developed  theoretically  so 
far  are  special  cases  of  the  SEU  Model  2 in  which  indepen- 
dence assumptions  imply  certain  decompositions  of  the  single 
outcome  utility  function  u(Xj).  Of  these  models  the  additive 
and  multiplicative  Models  3 and  4 are  by  far  the  most  promi- 
nent ones,  and  have  been  thoroughly  studied  in  theory  and 
applications.  Model  1 itself  has  not  received  much  theoreti- 
cal attention,  nor  has  it  been  applied  in  many  real  world 
decision  contexts. 

Figure  1 gives  an  overview  of  the  most  important  inde- 
pendence assumptions  and  models  for  risky  multiattribute 
preferences.  Models  and  independence  assumptions  are  orga- 
nized in  a flow  chart,  indicating  which  independence 
assumptions  have  to  be  satisfied  for  a model  to  be  valid. 
Diamonds  stand  for  independence  assumptions  (tests) , and 
rectangles  stand  for  models. 
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Figure  1 

Independence  Assumptions  and  Models 
for  Risky  Multiattribute  Preferences 


if  Uj(Xj)  = c^»u(x^)  for  all  j. 


if 
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ability  of  decomposition  Models  1-4  to  describe  human 
preferences.  Merely  four  experiments  have  any  implications 
for  a distinction  between  these  models:  Tversky  (1967), 
v.  Winterfeldt  (1971) , Fischer  (1972) , and  Delbeke  and 
Fauville  (1974) . 

Tversky  analyzed  subjects'  preferences  for  gambles  that 

I, 

had  two  outcomes,  cigarettes  and  candy.  Subjects  expressed 
their  preferences  by  stating  selling  prices  for  several  com- 
binations of  cigarettes  and  candy,  as  well  as  for  several  ' 

gambles  for  either  outcome.  The  results,  as  far  as  they 
bear  on  a distinction  between  risky  and  multiattribute  pref- 
erences  models,  are: 

1)  An  additive  model  could  describe  the  subjects' 
riskless  preferences  for  combinations  of  cigarettes 
and  candy  very  well. 

2)  If  one  assumed  that  riskless  utility  functions 
derived  from  the  subjects’  preferences  among  sure 
commodity  bundles  were  identical  up  to  a linear 
transformation  to  risky  utility  functions  derived 
from  subjects'  preferences  among  gambles  for  either 
commodity,  then  subjective  probabilities  did  not 
sum  to  unity.  If,  on  the  other  hand,  one  assumed 
that  probabilities  summed  to  1,  then  one  had  to 
acknowledge  that  risky  and  riskless  utility  func- 
tions deviated  consistently. 
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3)  In  spite  of  these  apparent  violations  of  SEU,  the 
overall  predictions  of  an  additive  SEU  model  in 
form  of  Model  2 fit  the  bids  for  gambles  with  both 
commodities  as  outcomes  extremely  well. 

When  Tversky  conducted  his  experiment,  models  such  as 
Models  3 and  4 were  not  known  in  their  precise  axiomatic 
foundation  presented  here.  In  the  light  of  these  axioms, 
Tversky' s results  could  be  interpreted  differently  today. 

For  example,  in  risky  multiattribute  preferences  there  is 
absolutely  no  need  for  risky  and  riskless  utility  functions 
to  coincide.  Consequently,  the  difference  between  risky  and 
riskless  utility  functions  that  Tversky  detected  may  not  be 
due  to  a violation  of  SEU,  but  rather  to  a violation  of  addi- 
tivity of  risky  utility  functions  in  Model  3.  if  one  assumes 
a non-additive  model,  as  in  Model  4,  that  difference  appears 
quite  natural  even  in  the  presence  of  the  SEU  model.  The 
more  recent  risky  multiattribute  models  and  independence 
assumptions  formulate  the  real  differences  between  the  models 
more  precisely  and  lend  themselves  to  experimental  tests. 

v.  Winterfeldt  (1971)  directly  analyzed  such  indepen- 
dence assumptions.  When  subjects  were  asked  to  state  their 
preferences  among  apartments  that  varied  only  on  two  attri- 
butes (distance  from  campus  and  transportation  facilities) , 
their  responses  supported  a riskless  additive  multiattribute 
model.  But  when  asked  to  state  preferences  among  gambles 
for  such  apartments  that  were  designed  according  to  the 
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marginality  assumption,  subjects  violated  the  risky  additive 
Model  3 systematically.  In  spite  of  such  systematic  viola- 
tions of  marginality,  an  additive  combination  of  utility 
functions  u^  in  the  form  of  Model  3 predicted  subjects'  pref- 
erences among  apartments  very  well. 

The  third  experiment  that  studied  risky  and  multiattri- 
bute preferences  was  performed  by  Fischer  (1972).  Fischer 
analyzed  subjects'  preferences  among  used  cars  by  determining 
their  global  preferences  for  gambles  with  used  cars  as  out- 
comes. He  found  that  the  utility  function  u(x)  that  was 
constructed  from  subjects'  risky  preferences  over  gambles 
with  used  cars  as  outcomes  deviated  significantly  from 
additivity,  indicating  a violation  of  Model  3.  However,  as 
in  v.  Winterfeldt ' s experiment,  Model  3 generated  a utility 
function  that  agreed  well  with  the  utility  function  derived 
from  subjects'  global  preferences. 

Delbeke  and  Fauville  (1974)  conducted  the  most  recent 
experiment.  The  explicit  purpose  of  their  study  was  to 
test  Fishburn's  marginality  assumption.  Subjects  had  to 
state  their  preferences  among  marginally  equivalent  gambles 
for  various  two-attribute  stimuli,  ranging  from  vacation 
trips  that  varied  on  the  attributes  location  and  mode  of 
transportation,  to  combinations  of  job  types  and  salaries. 
Although  the  small  number  of  subjects  and  stimuli  do  not 
allow  conclusive  interpretations  of  their  results,  Delbeke 
and  Fauville  identified  some  violations  of  marginality,  even 
if  riskless  preferences  were  additive. 
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The  message  that  these  experiments  convey  seems  con- 
tradictory: in  spite  of  obvious  model  violations  (tests  of 

marginality  and  tests  of  risky  additivity  failed) , additive 
models  such  as  Model  3 predict  subjects'  preferences  and 
utility  judgments  very  well.  This  is  the  same  result  that 
was  encountered  previously  for  the  simpler  riskless  multi- 
attribute models  and  risky  single  attribute  models.  The 
quality  in  prediction  of  global  preference  behavior  may 
similarly  be  interpreted  in  terms  of  the  robustness  of  simple 
additive  models,  which  describe  preferences  very  well,  even 
if  these  preferences  are  not  additive  and  violate  model 
assumptions. 

Results  of  further  global  model  validations  seem,  there- 
fore, rather  predictable,  no  matter  what  the  type  of  stimulus 
or  the  personal  characteristics  of  the  subjects  may  be. 

Simple  additive  expectation  models  will  describe  subjects' 
overall  preferences  rather  well,  although  they  are  obviously 
wrong  in  some  predictions.  The  more  interesting  questions 
that  remain  are:  where  are  models  likely  to  fail,  and  for 
what  reason?  For  example,  are  Tversky's  results  due  to  a 
genuine  failure  of  the  SEU  model,  or  are  they  a consequence 
of  the  non-additive  risky  utility  functions?  What  is  the 
relative  degree  of  violations  of  row  and  column  independence 
in  Models  1 and  2?  How  do  these  violations  depend  on  the 
stimulus  context?  To  what  degree  are  violations  dependent 
on  instructions  and  response  modes? 
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Knowing  the  answers  to  these  questions  not  only  will 
help  in  understanding  and  characterizing  human  decision 
making  behavior  under  risk  and  multiple  attributes  of  out- 
comes, but  may  also  improve  models  and  assessment  methods 
that  are  used  as  aids  in  real  world  decision  problems.  The 
specific  purpose  of  the  present  experiment  was  to  test  the 
independence  assumptions — row  and  column  independence,  util- 
ity independence,  and  marginality — that  were  described  in 
the  previous  section.  The  experiment  was  designed  to 
establish  the  degree,  the  form,  and  the  stability  of  viola- 
tions of  row  and  column  independence  in  risky  multiattribute 
preferences,  and  to  determine  whether  such  violations  depend 
on  personal  preference  characteristics  of  the  decision  maker. 


EXPERIMENTAL  ANALYSIS 
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Experimental  Questions  and  Hypotheses 

Four  specific  questions  were  singled  out  for  an  experi- 
mental analysis  of  the  four  independence  assumptions 
discussed  above: 

1)  Is  there  any  difference  in  the  degree  of  violation 
between  row  and  column  independence?  The  hypoth- 
esis was  that  row  independence  will  be  more 
strongly  violated  than  column  independence  since 
subjects  may  show  variance  preferences  for  gambles 
that  have  identical  marginal  probability  distri- 
butions. Column  independence  appears  to  have  a 
firmer  foundation  in  both  theory  and  the  experi- 
mental results  that  support  expectation  models  for 
single  and  multiattributed  outcomes.  To  test  the 

I 

relative  degree  of  independence  violations,  sym- 
metric tests  for  row  and  column  independence  with 
special  marginality  and  utility  independence  forms 
were  developed  for  the  experiment. 

2)  What  is  the  form  of  column  or  row  independence  vio- 

. 

lations,  and  to  what  degree  do  these  violations 
depend  on  stimulus  subsets?  In  particular,  the 
question  arises  how  the  violations  of  independence 
assumptions  are  related  to  the  context  rows  or 
columns.  Understanding  the  effect  of  these 
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context  rows  or  columns  is  a key  to  understanding 
the  characteristics  or  risky  multiattribute  pref- 
erences. To  establish  the  form  of  independence 
violations,  stimulus  subsets  were  selected  to  mag- 
nify possible  context  effects. 

3)  How  stable  are  violations  of  row  and  column  inde- 
pendence? Possible  violations  are  likely  to  depend 
on  response  modes  and  instructions.  In  Tversky's 
experiment,  for  example,  the  selling  price  response 
mode  may  have  forced  subjects  to  think  in  terms  of 
exchange  values  and  respond  in  terms  of  additive 
cash  equivalents,  when  asked  for  selling  prices  for 
two  commodities.  To  test  the  dependence  of  inde- 
pendence violations  on  response  modes,  three 
different  response  modes  were  selected  for  the 
experimental  tests:  preferences,  ratings,  and 
cash  equivalent  assessments  (selling  prices) . 

4)  Are  violations  of  row  or  column  independence  re- 
lated to  other  personal  preference  characteristics 
such  as  risk  attitude,  marginal  utilities  in  the 
single  attributes,  or  additivity/non-additivity  of 
riskless  perferences?  To  investigate  the  relation- 
ship between  such  personal  preference  characteris- 
tics and  independence  violations,  various  indices 
of  risk  attitude  were  assessed  and  an  index  for 
additivity  was  determined. 
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Method 


Stimuli . One  of  the  major  problems  of  any  experiment 
that  tries  to  learn  about  the  structure  of  human  preferences 
is  the  selection  of  appropriate  stimuli.  Preferences  often 
depend  very  much  on  the  physical  properties  of  stimuli,  often 
more  than  on  the  personal  preference  characteristics  of  the 
subject/decision  maker.  A good  example  are  preferences  among 
seats  in  a football  stadium.  There  exists  a physical  measure 
of  "visibility"  that  is  probably  a complex,  but  non-additive 
function  of  row  and  gate  numbers.  Preferences  for  most 
visitors  will  differ  little  from  that  physical  measure.  If 
one  wants  to  learn  about  the  subjective  nature  of  preferences, 
seats  in  a football  stadium  would  therefore  not  be  very 
interesting,  since  preferences  among  seats  could  be  modelled 
without  using  any  subjective  inputs.  All  one  may  learn  from 
observing  preferences  among  seats  in  a football  stadium  is 
how  well  a subject  can  estimate  that  physical  measure  of 
visibility. 

On  the  other  extreme,  some  stimuli  have  so  little 
physical  structure  that  preferences  are  very  idiosyncratic. 

An  example  is  preferences  among  combinations  of  coffee, 
sugar,  and  milk.  Few  subjects  will  agree  on  a "best"  mix, 
and  few  will  exhibit  similar  combination  rules.  Such  stimuli 
may,  in  fact,  be  interesting  to  study  experimentally,  but 
there  is  little  hope  to  ever  generalize  across  subjects  in 
statements  about  model  validity. 


In  principle,  any  study  of  human  preferences  is  sub- 
jective and  can  make  statements  only  about  individual 
results.  But  there  are  differences  in  the  degree  of  gener- 
alization depending  on  the  nature  of  the  stimuli.  In  the 
football  stadium  case,  one  could  probably  generalize  from  a 
pool  of  subjects  to  the  preferences  of  many  (which  one  could 
have  without  experimentation);  in  the  coffee  case,  it  would 
be  very  difficult  to  say  anything  beyond  individual  pref- 
erence structures.  In  the  design  of  a study  of  subjective 
preference  properties  like  the  ones  this  experiment  studies, 
one  would  like  to  be  able  to  make  intersubject  statements 
about  possible  degrees  and  forms  of  model  violations,  while 
still  examining  a basically  subjective  process.  The  selec- 
tion of  stimuli  determines  to  which  degree  such  balance  is 
possible. 

One  way  to  achieve  the  right  balance  is  by  studying 
many  different  stimuli  and  to  try  to  abstract  those  aspects 
of  preferences  that  are  independent  of  the  actual  stimulus, 
or  that  are  functions  of  interactions  between  the  stimuli 
and  preferences.  Unfortunately,  at  the  present  stage  of 
knowledge  about  risky  and  multiattribute  preferences,  such 
an  approach  would  be  very  uneconomical,  since  the  experi- 
mental questions  have  not  been  phrased  precisely  enough. 

An  alternative  is  to  select  a prototypical  stimulus  with 
some  well-known  physical  characteristics  that  still  leave 
room  for  the  manifestation  of  subjective  preferences.  Since 
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experimental  results  should  be  generated  by  the  subjective 
nature  of  preferences,  the  physical  nature  of  such  stimulus 
should  be  amendable  to  various  forms  of  risky  multiattribute 
models  to  avoid  preclusion  of  experimental  results. 

For  the  present  experiment,  this  latter  approach  was 
chosen  mainly  for  economical  reasons.  The  stimuli  that  were 
finally  selected  for  the  experiment  are  even  chance  gambles 
for  market  baskets  containing  certain  amounts  of  gasoline 
and  ground  beef.  A pilot  study  revealed  for  most  subjects 
strong  deviations  from  an  additive  cash  equivalent  model 
(in  which  subjects  simply  add  the  market  price  of  both 
commodities) , marginally  decreasing  utilities  for  beef, 
and  somewhat  less  marginally  decreasing  utilities  for 
gasoline.  All  of  these  properties  are  compatible  with 
Models  1-4. 

Stimuli  were  presented  to  subjects  in  a matrix  format. 
Figure  2 gives  an  example.  The  choice  set  X in  this  experi- 
ment existed,  therefore,  of  2 x 2 matrices  with  cell  elements 
representing  certain  amounts  of  gasoline  and  ground  beef. 

To  test  row  and  column  independence,  certain  special 
stimulus  forms  had  to  be  created.  The  general  idea  of  perform- 
ing these  tests  was  to  first  establish  an  indifference 
between  two  gambles  in  one  constant  row  or  column  context, 
and  then  to  check  if  that  indifference  held  up  under  a dif- 
ferent row  or  column  context.  For  example,  to  test  row 
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Figure  2 
Example  Stimuli 

Take  a careful  look  at  the  following  two  gambles.  Assume 
that  you  have  the  right  to  play  one  of  these  gambles  once. 


GAMBLE  1 


if  the  outcome 
of  a coin  flip 
is 


GAMBLE  2 


if  the  outcome 
of  a coin  flip 
is 


' 


I 


HEADS  TAILS 


HEADS  TAILS 


GALLONS  OF 
GASOLINE 


you  win 


or 


or 


16 

and 

I 

0 


8 

and 

10 


you  win 


0 

and 

10 


or 


16 

and 


10 


GALLONS  OF 
GASOLINE 


POUNDS  OF 
GROUND  BEEF 


or 


POUNDS  OF 
GROUND  BEEF 


independence,  two  row  stimuli  would  be  matched  by  the  sub- 
ject, by  varying  the  amount  x in  one  row: 


Gamble  1 
Heads  Tails 


Gamble  2 
Heads  Tails 


Gasoline  (10 


0 ) ^ (x 


Then  a row  independent  subject  should  be  indifferent  between 
the  following  two  check  stimuli: 


Gamble  1 
Heads  Tails 


Gamble  2 
Heads  Tails 


Beef 


no  matter  what  the  constant  amounts  a and  b in  the  context 
row  are. 

Similarly,  to  test  column  independence,  first  two 
columns  are  matched  by  the  subject,  e.g., 


Beef 


Bundle  1 

(:) 


Bundle  2 


Then  a column  independent  subject  should  be  indifferent 
between  the  following  two  check  stimuli: 


Gamble  1 
Heads  Tails 


Gamble  2 
Heads  Tails 


Beef 


independently  of  the  amounts  a and  b in  the  context  column. 
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To  construct  check  stimuli  in  this  "match  and  check" 
procedure,  two  similar  matches  were  selected  for  row  and 
column  tests,  one  match  labelled  EE  (for  extreme-extreme)  and 
one  labelled  EM  (for  extreme-middle) . The  EE  matches  involve 
a substantial  stimulus  change  or  trade-off,  while  the  EM 
matches  require  less  trade-off.  Table  1 lists  these  four 
matches.  One  additional  EM  match  was  created  to  futher  test 
row  independence  in  its  utility  independence  form. 


Table  1 

List  of  Matches  for  the  Match  and  Check  Procedure 


Extreme - 
extreme 


Extreme- 

middle 


Row  Matches 


H T H T 


Column  Matches 


Gas  (16  0)  % (0  16) 


H T H T 

Gas  (16  0)  ^ (x  x) 


Beef 


Beef 


sure  sure 

o ~ (:) 


sure  sure 

C) ' G) 


Extreme- 

middle 


H T H T 

Beef  (10  0)  (x  x) 


It  was  not  possible  to  construct  row  and  column  matches 
in  perfect  symmetry.  One  reason  was  that  the  EE  match  for 
rows  follows  trivially  from  the  equal  likelihood  of  the 
events  Heads  and  Tails,  while  the  EE  match  for  columns  needs 


to  be  assessed  by  the  subject.  Another  reason  was  that  the 
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EM  match  in  rows  has  equal  amounts,  while  the  EM  match  for 
columns  did  not  necessarily  have  that  property.  Neverthe- 
less, the  matches  were  created  as  symmetrically  as  possible. 

The  five  matches  in  Table  1 were  coupled  with  nine 
possible  context  rows  or  columns  that  were  generated  from 
combinations  of  the  following  single  commodity  values: 


Gasoline 


0,  8,  16  Gallons 


Ground  Beef:  0,  5,  10  Pounds. 


Forty-five  check  items  that  included  pairs  of  gambles  that 
should  be  indifferent  according  to  either  row  or  column 
independence  were  created  this  way.  The  following  examples 
show  how  these  check  items  can  test  row  independence,  utility 
independence,  marginality,  as  well  as  similar  forms  of  column 
independence. 

The  EE  row  match  coupled  with  its  context  rows  tests  the 
marginality  assumption  nine  times: 


Gamble  1 
Heads  Tails 


Gamble  2 
Heads  Tails 


Beef 


for  a,  b = 0,  5,  10. 


The  EM  row  matches  coupled  with  their  context  rows  test 
row  independence  in  its  pure  form  and  utility  independence. 
In  the  gasoline  match,  for  example,  check  items  are  of  the 


following  type: 
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Gas 
Beef 

for  a,  b = 0,  5,  10. 

If  a = b,  these  check  items  test  utility  independence. 
The  remaining  check  items  test  row  independence  neither  in 
its  marginality  nor  in  its  utility  independence  form. 

Similar  tests  are  generated  for  the  additional  EM  row  matches 
in  the  beef  commodity: 


Gamble  1 
Heads  Tails 
"16  0 


Gamble  2 
Heads  Tails 
x x 


r16  °i  . r*  -l 

La  b J La  b_ 


Gas 

Beef 


Gamble  1 
Heads  Tails 


% 


Gamble  2 
Heads  Tails 


The  column  check  items  are  very  similar  in  form  to  the 
row  check  items.  The  EE  matches  create  column  check  items 
that  correspond  to  the  marginality  test: 

Gamble  1 Gamble  2 

Heads  Tails  Heads  Tails 


Gas 

" 0 

a 

% 

X 

a 

Beef 

10 

b 

0 

b 

for  a = 0,  8,  16;  and  b = 0,  5,  10. 

The  other  nine  check  column  items  are: 


Gamble  1 Gamble  2 

Heads  Tails  Heads  Tails 


Gas 

0 a 

x a 

Beef 

O 

tr 

I 

5 b 

for  a = 0,  8,  16;  and  b = 0,  5,  10. 

Subjects . Subjects  v/ere  18  upper  level  students  (mostly 
graduates)  and  staff  members  from  the  University  of  Southern 
California.  All  subjects  could  use  gasoline  and  ground  beef 
for  consumption.  They  were  paid  a flat  fee  of  $12  for  a 
total  of  3-5  hours  of  participation  in  the  experiment.  In 
addition,  they  had  a chance  to  win  up  to  16  gallons  of  gaso- 
line, 10  pounds  of  ground  beef,  or  $20  in  gambles  that  were 
actually  played  after  the  experiment  in  accordance  with  their 
preferences. 


Instructions  and  Response  Modes.  Subjects  were  pre- 
sented with  the  45  check  stimuli  twice.  In  the  first  run 
they  did  not  receive  any  special  instructions.  In  the  second 
run  six  subjects  repeated  the  first  run,  six  were  cued  to 
compare  the  gambles  in  a check  stimulus  by  comparing  their 
rows  first,  and  six  were  cued  to  compare  their  columns  first. 
To  force  subjects  to  attend  to  rows  (columns)  when  comparing 
gambles  in  check  items,  the  rows  or  columns  that  constituted 
the  pair  of  gambles  were  presented  to  them  on  a page  to  the 
left  of  the  check  item  page  and  they  were  asked  to  mark  their 
preferences  among  these  rows  (columns)  first,  and  then  to 
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look  at  the  check  item.  They  were  told  to  carefully  consider 
their  preferences  among  the  rows  (columns)  when  making  their 
preference  judgments  about  the  combined  gambles. 

To  quantify  the  subjects'  preferences,  and  to  determine 
if  response  modes  influenced  the  violations  of  independence 
assumptions,  three  response  modes  were  used:  simple  pref- 
erences or  indifferences,  a rating  of  the  strength  of 
preferences,  and  assessments  of  cash  equivalents  for  both 
gambles  in  each  check  item.  In  the  first  response  mode  sub- 
jects simply  had  to  mark  the  preferred  gamble  in  each  check 
item.  In  the  second  response  mode  they  marked  the  strength 
of  their  preference  for  the  preferred  gamble  on  a continuous 
scale  that  varied  from  0 (indifference)  to  100  (very  strong 
preference) . In  the  final  response  mode  subjects  were  asked 
to  independently  assess  for  each  gamble  of  a check  stimulus 
an  amount  of  money  such  that  they  would  be  indifferent  be- 
tween playing  the  gamble  or  receiving  the  sure  amount  of 
money  as  a gift.  To  reinforce  subjects  to  state  their  true 
indifference  point  Marschak's  bidding  procedure  was  applied 
and  it  was  explained  to  subjects  that  some  of  the  gambles 
would  be  played  after  the  experiment  according  to  Marschak's 
procedure  (see  Becker,  DeGroot,  and  Marschak,  1964) . Figure 
3 gives  an  example  of  a response  sheet  to  one  check  item. 

Procedure.  Subjects  were  run  individually  in  two 
sessions.  The  first  session  lasted  approximately  2-3  hours. 
The  second  session  lasted  between  1%  to  2 hours.  At  the 
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1.  MARK  THE  GAMBLE  THAT  YOU  WOULD  PREFER  TO  PLAY  IN  THE  SPACE 
BELOW.  (If  you  are  indifferent,  mark  both  spaces.) 


INDICATE  ON  THE  FOLLOWING  SCALE  HOW  MUCH  YOU  PREFER  THAT 
GAMBLE  RELATIVE  TO  THE  OTHER  ONE.  (0  means  "no  pref- 
erence"? 100  means  "very  strong  preference".) 


STATE  YOUR  CASH  EQUIVALENTS  FOR  BOTH  GAMBLES  IN  THE  SPACES 
BELOW.  (Remember  to  check  if  you  are  really  indifferent 
between  playing  the  gamble  or  receiving  your  cash  equiva- 
lent as  a gift.) 
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beginning  of  the  experiment  subjects  received  a detailed  15 
page  instruction  booklet,  which  contained  a brief  introduction 
to  the  experiment,  examples  of  the  types  of  questions  that 
would  be  asked,  and  a description  of  the  experimental  proce- 
dure. Subjects  were  instructed  that  their  task  was  to  express 
their  preferences  among  a series  of  pairs  of  gambles  and  to 
make  indifference  judgments  about  them.  They  were  told  that 
some  of  these  gambles  would  be  played  after  the  experiment 
according  to  their  judgments  and  preferences,  and  that  they 
could  win  up  to  16  gallons  of  gasoline,  10  pounds  of  ground 
beef,  or  $20  if  they  expressed  their  true  preferences.  Sub- 
jects were  warned  that  simplifying  strategies  could  hurt 
their  prospective  winnings  if  they  did  not  coincide  with 
their  true  preferences.  Besides  these  general  instructions, 
several  examples  for  stimuli  and  response  modes  were  pre- 
sented and  explained  in  the  booklet.  After  the  subjects 
read  the  instructions,  the  experimenter  made  sure  that  they 
understood  them,  and  again  warned  them  to  express  their  true 
preferences.  Marschak's  bidding  procedure  was  explained 
verbally  once  more  in  two  examples  to  make  sure  that  all 
subjects  understood  its  meaning. 

Subsequently  subjects  were  presented  with  training 
booklet  I,  which  contained  several  example  problems,  assess- 
ments of  certainty  equivalents  for  gas  and  beef  gambles,  and 
assessments  of  various  cash  equivalents  for  complex  gambles. 
All  cash  equivalents  for  the  bundles  including  the  amounts 
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in  on  a range  in  which  subjects  switched  their  preference. 
This  fact  was  pointed  out  to  subjects,  and  they  were  asked 
to  set  their  indifference  point.  At  each  stage  the  experi- 
menter encouraged  subjects  to  revise  their  previous  answers, 
if  they  felt  that  they  did  not  reflect  their  thinking  any 
more. 

The  first  session  was  concluded  with  the  presentation 
of  three  booklets  containing  the  first  45  check  items.  The 
order  of  check  items  and  the  position  of  the  gambles  in  a 
check  item  were  randomized. 

The  second  session  followed  within  two  days  after  the 
first.  Subjects  were  presented  with  the  same  45  check  items 
in  the  same  order  as  in  the  first  session.  Six  subjects 
received  no  special  instructions,  six  received  row  instruc- 
tions, and  six  received  column  instructions. 

Results 

Data  Description  and  Outline  of  Analyses.  Each  of  the 
18  subjects  was  presented  with  a total  of  90  check  items 
(5  matches,  9 contexts,  and  2 sessions).  For  each  check 
item  the  raw  data  consisted  of: 

1)  Preference  judgment; 

2)  Rating  of  the  strength  of  preference; 

3)  Cash  equivalents  for  the  left  and  the  right  gamble 
in  the  check  item  (CL  and  CR) . 

The  following  derived  measures  were  used  in  the  analysis: 

4)  Dollar  difference  (DD) : DD  = CL  - CR; 
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5)  Absolute  dollar  difference  (ADD):  ADD  = abs (DD) : 

6)  Percentage  dollar  difference  (PDD) : 

PDD  = DD* 100/max (CL, CR) . 

In  addition,  the  cash  equivalents  for  riskless  combinations 
of  gasoline  and  ground  beef  assessed  in  training  sessions  I 
and  II  were  used  as  background  information  about  the  subjects' 
substitution  rates  between  gasoline,  beef,  and  cash. 

These  data  were  analyzed  on  increasing  levels  of  detail. 
The  first  analysis  was  concerned  with  the  consistency  of 
responses.  The  second  analysis  compared  the  subjects'  re- 


sponses with  some  simple  strategies  and  models.  Then  the 
overall  effects  of  response  modes,  instructions,  and  items 
were  analyzed  on  a gross  level  to  sort  out  the  main  sources 
of  variation.  On  the  next  level  a fine  analysis  of  the 
context  dependence  of  violations  of  independence  assumptions 
was  performed,  and  finally  the  relationships  between  indi- 
vidual parameters  characterizing  risk  attitude,  substitution 
rates,  etc.  and  independence  violations  was  studied. 

Consistency  Analysis.  Since  each  check  item  was  pre- 
sented twice,  a preliminary  consistency  analysis  could  be 
performed  on  the  changes  of  preferences  that  occurred  between 
the  two  sessions.  The  first  column  of  Table  2 lists  for  each 
subject  the  percentage  of  true  preference  reversals,  that  is 
of  switches  of  preferences  from  the  left  to  the  right  gamble 
in  a check  item  or  vice  versa.  Not  included  are  changes  from 
or  to  indifference.  Considering  the  complexity  of  the  task, 


Table  2 

Intrasubject  Consistency  Data  of  Responses  Between 
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the  consistency  seems  appreciable  with  a median  of  6.7% 
preference  reversals.  Eight  subjects  had  less  than  5%  pref- 
erence reversals.  Only  two  subjects  showed  pronounced 
inconsistencies  (subjects  7 and  17).  Instruction  groups 
(S1-S6,  S7-S12,  S13-S18)  do  not  show  any  marked  differences 
in  consistency.  Accumulated  across  subjects  the  consistency 
statistics  are  as  follows:  78.4%  identical  preferences, 

13.6%  changes  from  or  two  indifference,  and  8.0%  true  pref- 
erence reversals. 

The  remaining  columns  of  Table  2 present  the  intrasub- 
ject consistency  of  cash  equivalent  and  rating  responses,  and 
of  the  derived  measures,  dollar  difference  and  percentage 
dollar  difference.  As  a consistency  measure  the  median  abso- 
lute difference  of  these  measures  between  session  one  and 
two  was  computed.  Again,  the  consistency  seems  considerable. 
Only  three  subjects  have  a median  that  is  higher  than  .75 
(cash  equivalent),  12  (rating),  .85  (dollar  difference),  or 
11%  (percentage  dollar  difference) . Noteworthy  exemptions 
are  subject  16  who  appears  inconsistent  in  all  measures 
except  rating,  and  subject  17  whose  ratings  are  very  incon- 
sistent. 

It  is  quite  possible  that  the  relatively  high  degree  of 
consistency  reflected  in  Table  2 was  a result  of  simplifying 
strategies  such  as  always  preferring  the  gamble  with  the 
higher  sum  of  cell  entries  or  giving  cash  equivalents  equal 
to  the  expected  cash  value  of  the  gambles.  This  possibility 
will  be  examined  next. 
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Strategy  analysis.  To  check  the  possibility  of  simpli- 
fying strategies  the  following  simple  expected  value  model 
was  constructed  to  predict  cash  equivalent  assessments: 

/\ 

CL  = 1/2  (x  + y + v + w) 

where  x,  y,  v,  and  w are  the  cell  entries  of  a gamble,  and 

/\ 

CL  is  the  predicted  cash  equivalent.  Any  subject  who 
determined  his  cash  equivalent  by  some  simple  arithmetic 

would  produce  responses  that  are  highly  correlated  with  CL. 

/\ 

Table  3 gives  the  correlations  between  CL  and  CL  for  each 
subject.  In  general  the  predictive  validity  of  this  simple 
model  is  high  (Median:  .70),  reflecting  once  more  the  sta- 
bility of  simple  additive  models  to  predict  evaluations  that 
are  monotonically  related  to  input  variables.  However,  for 
most  subjects  the  model  validities  lie  below  the  subjects 
reliabilities  as  computed  by  the  retest  correlation  between 
cash  equivalent  responses  in  the  first  and  the  second  session 
(Median:  .79). 

There  are  other  indications  that  subjects  deviated  from 
simple  expected  cash  equivalent  models.  Such  models  assume 
neutral  risk  attitude  (expected  value  assumption) , linear 
substitution  of  commodities  and  of  commodities  with  cash,  and 
additivity.  To  determine  if  subjects  met  these  assumptions, 
the  following  indices  were  computed: 


f*v  %/ 
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Table  3 

Validities  of  a Simple  Expected  Value 
Model  (v)  and  Reliabilities  of 
Cash  Equivalent  Assessments  (r) 


Subject 

V 

(n=  72) 

-I 

r 

(n  = 36) 

1 

Subject 

V 

(n=  72) 

r 

(n  = 36) 

1 

.80 

.85 

■ 

.67 

.78 

2 

.66 

.93 

■ i 

11 

.73 

.64 

3 

.74 

.88 

■ 

12 

.55 

.79 

4 

.61 

.68 

■ 

13 

.65 

.86 

5 

.80 

.88 

1 

14 

.80 

.91 

6 

.93 

.98 

1 

15 

.70 

.78 

7 

.78 

.74 

16 

.36 

.26 

8 

.70 

.59 

■ 

17 

.78 

.74 

9 

.30 

.54 

■ 

18 

.65 

l 

.87 

Median  v:  0.70 
Median  r:  0.79 
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1)  Risk  premium  for  the  gasoline  gamble  (16  0)  : 

RP_  = EV (16  0)  - CE (16  0), 

where  the  left  term  stands  for  the  expected  value 
of  the  gamble  (EV) , and  the  right  term  for  its 
assessed  cash  equivalent  (CE) . 

2)  Risk  premium  for  the  beef  gamble  (10  0) : 

RP_  = EV  ( 10  0)  - CE  (10  0). 

D 

If  the  risk  premium  is  0 the  subject  is  called  risk  neutral, 
if  it  is  larger  than  0 he  is  called  risk  averse,  if  it  is 
smaller  than  0 risk  seeking  (see  Raiffa,  1968) . 

3)  An  indicator  for  the  way  gasoline  and  cash  sub- 
stitute : 

SIG  = $ (16)/$  (8) , 

where  $ (a)  stands  for  the  cash  equivalent  the  sub- 
ject assessed  for  a gallons  of  gasoline. 

4)  An  indicator  for  the  way  beef  and  cash  substitute: 

SIB  = $ (10) /$  (5) , 

where  $ (b)  stands  for  the  cash  equivalent  the  sub- 
ject assessed  for  b pounds  of  ground  beef. 

In  case  of  linear  substitution  these  ratios  would  equal  2, 
they  would  be  smaller  than  2 if  the  commodity  is  substituted 
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with  money  at  a decreasing  rate,  and  greater  than  2 if  it  is 
substituted  at  an  increasing  rate. 

5)  An  additivity  indicator: 

AI  = $ (a)  + $(b)  - $ 

where  the  cash  equivalent  assessed  to  the 

commodity  bundle  containing  a gallons  of  gasoline 
and  b pounds  of  ground  beef;  $ (a)  and  $ (b)  are  the 
respective  cash  equivalents  for  the  single 
commodities . 

This  additivity  index  would  be  zero  if  there  was  no  inter- 
action between  the  commodities,  positive  if  the  combination 

of  commodities  had  a negative  effect,  and  negative  if  the 
combination  of  the  commodities  had  a positive  effect  on 
their  value.  The  additivity  index  used  in  the  further 
analysis  was  actually  an  average  of  eight  indicators  of  the 
above  form  computed  from  assessments  *in  the  first  two 
training  sessions.  All  other  indicators  were  computed  from 
the  unique  assessments  made  in  the  final  training  session. 

Table  4 shows  how  subjects  distribute  over  these  indi- 
cators. Even  with  a rather  liberal  definition  of  neutral 
risk  attitude  (the  risk  premium  had  to  fall  between  -1.00 
and  +1.00)  only  four  subjects  were  risk  neutral  for  the 
gasoline  gamble  and  five  for  the  beef  gamble.  The  remaining 
subjects  distribute  relatively  evenly  into  the  risk  seeking 
and  the  risk  averse  categories.  An  interesting  result  was 
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that  six  subjects  had  different  risk  attitudes  for  gas  and 
beef  according  to  this  classification.  One  subject  was 
strongly  risk  averse  in  the  gasoline  gamble  (RPG  = -.200) 
but  risk  seeking  in  the  beef  gamble  (RPfi  = +2.00).  Only  one 
subject  was  perfectly  risk  neutral  in  both  commodities 
(RP  = 0)  . 

Table  4 

Distribution  of  Subjects  According  to 
Risk  Attitude,  Substitution  Rates, 
and  Non-Additivity 


Risk  Neutral 
■1.00  < RP  < 1.00 


Risk  Averse 
RP  > 1.00 


Risk  Seeking 
RP  < -1.00 


Gas 


Beef 


Linear 
Substitution 
1.75  < SI  < 2.25 


Gas 


Beef 


Decreasing 
Substitution 
SI  < 1.75 

Increasing 
Substitution 
SI  > 2.25 

3 

3 

Additive 
-1.00  < AI  < 1.00 

Negative 

Interaction 

AI  < -1.00 

Positive 

Interaction 

AI  > 1.00 

8 

8 

2 

The  distribution  over  the  substitution  index  (gas  with 
cash)  shows  that  the  majority  of  subjects  had  close  to  linear 
substitution  rates.  They  were  willing  to  exchange  the  first 
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8 gallons  of  gasoline  at  approximately  the  same  cash 
amount  as  an  additional  8 gallons.  In  the  beef  commodity, 
however,  the  majority  of  subjects  showed  a decreasing  rate 
of  substituting  beef  for  cash.  Only  one  subject  had  an 
increasing  rate.  This  means  that  most  subjects  would  pay 
more  for  the  first  5 pound  package  of  beef  than  for  the 
second. 

The  last  box  in  Table  4 shows  that  most  subjects 
were  close  to  additive  in  their  cash  evaluation  of 
commodity  bundles  (8  subjects)  or  displayed  a negative 
interaction  (8  subjects). 

In  summary:  Subjects  showed  pronounced  deviations  from 
the  expected  value  model  and  linear  substitution  of  beef 
with  cash.  Only  a single  subject  falls  into  the  categories 
"risk  neutral",  "linear  substitution",  and  "additive"  in 
all  cases  of  Table  4 (subject  6). 

Overall  Effects.  The  response  mode  effect  was  rather 
limited.  Ratings  of  the  strength  of  preference  among 
gambles  were  correlated  highly  enough  with  dollar  differ- 
ences (Median  r:  .74)  to  conclude  that  they  represented 
similar  judgments  about  the  relative  value  of  the  gambles. 
Only  one  subject  had  essentially  a zero  correlation 
between  ratings  and  dollar  differences.  She  was  indiffer- 
ent between  all  column  check  items,  but  she  assessed 
different  cash  equivalents  for  most  column  pairs.  Only 
two  other  subjects  had  correlations  between  ratings  and 
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dollar  differences  that  were  smaller  than  .66.  The  pre- 
ferred gamble  had  a lower  cash  equivalent  in  merely  1.9% 
of  all  check  items.  Therefore,  in  the  following  analyses 
of  the  effects  of  instructions,  matches,  and  contexts, 
only  preferences,  dollar  differences,  and  absolute  dollars 
differences  are  used  as  dependent  variables. 

The  initial  question  was:  what  was  the  overall  trend 
in  preferences  among  check  items,  and  what  was  the  general 
level  of  model  violations?  Already  on  the  level  of 
ordinal  preferences  a strong  effect  appeared  that  was 
somewhat  unexpected.  Subjects  tended  to  prefer  the  gamble 
that  had  as  one  element  a matched  value.  This  effect 
held  for  most  subjects  independently  of  context  rows  or 
columns.  Table  5 shows  the  distribution  of  preferences 
for  each  match  accumulated  across  subjects,  sessions,  and 
context.  Preferences  were  evenly  distributed  around 
indifference  only  in  the  case  of  the  'natural'  row  match 
(16  0)  ^ (0  16).  In  the  case  of  the  two  other  row 

matches,  preferences  were  heavily  biased  in  the  direction 
of  the  certainty  equivalent  side.  In  the  case  of  the  col- 
umn matches,  preferences  were  biassed  toward  the  matched 
gasoline  value.  Only  one  subject  deviated  markedly  from 
this  general  pattern.  He  always  preferred  the  left 
gamble  of  a column  check  item. 


1 


Table  6 tells  the  same  story  in  terms  of  mean  dollar 
differences.  The  bias  towards  the  gamble  with  the 
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Table  5 

Distribution  of  Preferences  (in  %)  as  a 
Function  of  Matches  (Across  Subjects 
Sessions,  and  Contexts;  n = 324) 


Match 

Pref 

Left 

erred  G< 

No 

amble 

Right 

Total 

(16  0)  ^ (0  16) 

30.2 

43.8 

25.9 

100 

(16  0)  'Mx  x ) 

11.1 

21.6 

67.3 

100 

(10  0)  ^ (x  x ) 

16.0 

16.7 

67.3 

100 

(°») ' (o) 

11.7 

37.0 

51.2 

100 

(aMs) 

16.7 

23.5 

59.9 

100 

Total 

17.2 

28.5 

54.3 

100 

Table  6 

Mean  Dollar  Differences  and  Standard  Errors 
as  a Function  of  Matches  (Across  Subjects, 
Sessions,  and  Contexts;  n = 324) 

(A  minus  sign  means  preference  for  the  right  gamble) 


Match 

Mean 

Standard 

Error 

(16  0)  -v  (0  16) 

0.01 

0.08 

(16  0)  'v  (x  x ) 

-0.78 

0.09 

(10  0)  'v-  (x  x ) 

-0.91 

0.08 

(&) ' (0 

-0.38 

0.12 

(l0.)'  5 

-0.39 

0.08 
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row  instructions  7.4%,  and  under  column  instructions  8.3%. 

For  a more  detailed  analysis  of  the  effects  of  row 
and  column  items  and  of  instructions,  absolute  dollar 
difference  was  used  as  a dependent  variable.  Table  7 
presents  the  individual  means  (in  brackets:  standard 
errors)  of  ADD  for  row  and  column  check  items  in  both 
sessions.  Several  trends  become  obvious  when  inspecting 
this  table.  First,  the  general  level  of  independence 
violations  is  not  very  high  (Grand  mean  across  subjects, 
sessions,  and  items:  $1.08;  standard  deviation:  1.42). 
Second,  there  are  notable  intersubject  differences  in  the 
degree  and  the  relative  size  of  the  violations  of  row  and 
column  independence.  Finally,  the  variance  of  ADD  for 
row  and  column  check  items  seemed  to  differ  frequently, 
with  a tendency  of  the  row  variance  to  be  larger  than  the 
column  variance  (exceptions  are  subjects  14  and  16). 

An  F-test  showed  that  row  and  column  variances  differed 
significantly  for  9 subjects  (p  < .05). 

Because  of  the  intersubject  variability  and  the 
different  variances  of  row  and  column  items  the  session 
effect  was  tested  separately  for  each  subject  and  for 
row  and  column  responses  with  a t-test  for  paired  samples. 
Of  the  36  t-tests  5 showed  a significant  difference 
between  mean  ADDS  for  column  items,  two  for  row  items 
(p  < .05).  No  systematic  pattern  of  instruction  effects 
could  be  identified  in  these  differences.  In  the  group 


for  Row  and  Column  Check  Items  in  Each  Session 


(In  brackets:  Standard  Errors;  n=18) 


Subject 

Session 

1 

2 

X 

t 

P 

1 

Row 

Col 

1.06 

0.44 

(.17) 

(.14) 

1.02 

0.33 

(.14) 

(.16) 

1.04 

0.39 

(.11) 

(.10) 

4.32 

.00 

2 

Row 

Col 

1.83 

0.81 

(.34) 

(.14) 

2.00 

0.88 

(.34) 

(.11) 

1.91 

0.84 

(.24) 

(.09) 

4.25 

.00 

3 

Row 

Col 

1.28 

1.11 

(.26) 

(.22) 

0.94 

0.89 

(.17) 

(.23) 

1.11 

1.00 

(.16) 

(.10) 

0.50 

.61 

4 

Row 

Col 

1.33 

0.33 

(.29) 

(.05) 

0.82 

0.25 

(.20) 

(.05) 

1.08 

0.29 

(.17) 

(.02) 

4.62 

.00 

5 

Row 

Col 

0.75 

0.61 

(.19) 

(.14) 

0.61 

0.28 

(.13) 

(.06) 

0.68 

0.45 

(.12) 

(.08) 

1.60 

.10 

6 

Row 

Col 

0.00 

0.00 

(.00) 

(.00) 

0.00 

0.00 

(.00) 

(.00) 

0.00 

0.00 

(.00) 

(.00) 

0.00 

.50 

7 

Row 

Col 

1.39 

1.33 

(.32) 

(.29) 

1.28 

0.75 

(.28) 

(.09) 

1.34 

1.05 

(.21) 

(.16) 

1.10 

.27 

8 

Row 

Col 

1.73 

2.13 

(.44) 

(.33) 

1.69 

1.31 

(.44) 

(.28) 

1.71 

1.72 

(.31) 

(.23) 

-0.03 

.97 

9 

Row 

Col 

1.36 

1.05 

(.27) 

(.19) 

0.75 

0.64 

(.16) 

(.07) 

1.05 

0.85 

(.16) 

(.06) 

1.17 

.30 

10 

Row 

Col 

0.75 

0.89 

(.16) 

(.18) 

0.63 

0.51 

(.16) 

(.12) 

0.69 

0.67 

(.09) 

(.11) 

0.14 

.92 

11 

Row 

Col 

0.67 

0.56 

(.17) 

(.16) 

1.11 

0.75 

(.26) 

(.15) 

0.89 

0.65 

(.16) 

(.11) 

1.24 

.22 

y 
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Table  7 (continued) 


Subject 

Session 

1 

2 

X 

t 

12 

Row 

Col 

0.78 

1.06 

(.17) 

(.06) 

1.06 

1.00 

(.25) 

(.00) 

0.92 

1.03 

( .15) 
(.03) 

-0.92 

13 

Row 

Col 

1.40 

0.09 

(.29) 

(.06) 

1.29 

0.00 

(.23) 

(.00) 

1.35 

0.05 

(.18) 

(.03) 

7.12 

14 

Row 

Col 

0.89 

2.75 

(.16) 

(.20) 

0.47 

2.35 

(.12) 

(.20) 

0.68 

2.55 

( .11) 
(.11) 

-12.02 

15 

Row 

Col 

0.79 

0.11 

(.23) 

(.04) 

0.96 

0.37 

(.20) 

(.04) 

0.88 

0.23 

(.15) 

(.04) 

4.19 

16 

Row 

Col 

2.78 

4.83 

(.50) 

(.79) 

2.50 

5.67 

(.47) 

(.93) 

2.64 

5.25 

(.34) 

(.62) 

-3.69 

17 

Row 

Col 

1.21 

1.04 

(.17) 

(.23) 

1.87 
1.  39 

(.27) 

(.16) 

1.54 

1.21 

(.17) 

(.11) 

1.63 

18 

Row 

Col 

0.95 

0.40 

(.26) 

(.12) 

0.53 

0.26 

(.26) 

(.13) 

0.74 

0.34 

(.18) 

(.07) 

2.07 

Total  row  : 1.13  (.05) 


.47 

.00 

.00 

.00 

.00 

.14 

.05 


Total  col  : 1.03  (.06) 


of  subjects  that  did  not  receive  any  special  instructions, 
one  subject  significantly  decreased  his  column  violations 
in  the  second  session.  In  the  row  instruction  group  two 
subjects  decreased  their  column  violations  significantly, 
one  decreased  his  row  violations  significantly.  In  the 
column  instruction  group  one  subject  increased  his  column 
violations,  one  subject  decreased  his  column  violations, 
and  one  subject  increased  his  row  violations. 

To  test  the  difference  between  mean  ADDS  of  row  and 
column  check  items,  ADD  was  averaged  across  sessions. 

The  average  ADD  for  row  and  column  check  item  are  listed 
in  the  fourth  column  of  Table  7 together  with  their 
standard  errors.  The  last  two  columns  give  the  t values 
and  the  significance  level.  For  13  subjects  row  indepen- 
dence was  more  strongly  violated  than  column  independence. 
Six  of  these  differences  are  significant  at  the  .05  level 
or  lower.  Only  for  two  subjects  (S14  and  S16)  column 
violations  were  significantly  higher  than  row  violations. 
Notable  is  the  lack  of  any  violations  for  S6 . Across 
subjects  the  difference  between  row  and  column  violations 
is  insignificant. 

In  summary:  the  analysis  of  overall  effects  found 
no  substantial  response  mode  effects,  strong  intersubject 
variability  in  the  degree  and  relative  size  of  row  and 
column  independence  violations,  no  systematic  instruction 


effects,  and  only  a modest  effect  of  row  vs.  column 


58 


violations  that  differed  among  subjects.  To  understand 
more  deeply  the  nature  of  row  and  column  violations,  it 
seems  therefore  necessary  to  look  into  the  effects  of 
specific  context  rows  or  columns  on  responses,  and  to 
establish  the  specific  pattern  of  the  row  and  column 
violations  as  a function  of  these  contexts.  This  analysis 
will  be  done  next. 

Context  analysis.  Tables  8-12  present  the  most 
interesting  results  of  the  experiment.  They  show  the 
preference  distributions  and  mean  dollar  differences 
(together  with  their  standard  errors)  for  each  of  the  five 
matches  as  a function  of  context  row  or  column.  The  data 
are  accumulated  across  subjects  and  sessions. 

First  to  the  'natural'  row  match  (16  0)  ^ (0  16). 

As  one  would  expect  subjects  were  predominantly  indifferent 
in  all  those  check  items  that  were  generated  by  constant 
context  rows  (the  first  three  contexts  in  Table  8) . Only 
two  responses  were  not  indifferent  (for  the  (10  10) 

context)  presumably  because  the  subjects  made  an  error. 

The  block  of  the  next  three  context  rows  represents 
those  check  items  in  which  the  left  gamble  had  more 
balanced  outcomes  than  the  right.  That  is  in  the  left 
gamble  the  subjects  had  a chance  of  winning  some  modest 
amount  in  either  event,  while  in  the  right  gamble  they 
had  a chance  at  winning  much  vs.  nothing.  A large  marjor- 
ity  of  the  subjects  preferred  the  left  gamble,  following 
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Table  8 


Preferences  and  Mean  Dollar  Differences 
as  a Function  of  Context  Rows 
(Match:  (16  0)  ^ (0  16);  n = 36) 


p 

Context  Row 



m 

>refere 

No 

— 

nces 

Right 

Mean 

* 

DD 

Standard 

Error 

■ 

(0 

0) 

0 

100 

0 

0.00 

0.00 

(5 

5) 

0 

100 

0 

0.00 

0.00 

(10 

10) 

0 

94 

6 

-0.06 

0.06 

(0 

5) 

83 

14 

3 

0.75 

0.16 

(5 

10) 

59 

22 

19 

0.90 

0.23 

(0 

10) 

81 

8 

11 

1.23 

0.31 

(5 

0) 

10 

18 

72 

-0.57 

0.15 

(10 

5) 

25 

22 

53 

-0.83 

0.34 

(10 

0) 

3 

19 

78 

-1.36 

0.22 

i 

; ■ 
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A general  trend  towards  "multivariate  risk  aversion",  as 
Richard  (1975)  called  it.  This  effect  is  least  pronounced 
for  the  (5  10)  context  row,  and  strong  for  the  (0  5) 

and  (0  10)  rows.  The  mean  dollar  differences  reflect 

that  trend  in  dollar  terms.  A positive  dollar  difference 
here  means  a higher  cash  equivalent  for  the  left  gamble. 

The  last  three  rows  in  Table  8 show  the  results  for  the 
inverted  context  rows,  which  created  identical  check  items 
but  in  reversed  order.  The  results  reflect  that  symmetry. 

To  determine  to  which  degree  the  pattern  of  violations 
in  Table  8 reflects  individual  responses,  the  number  of 
preferences  that  agreed  with  the  multivariate  risk  aversion 
pattern  was  counted  for  each  subject.  Multivariate  risk 
aversion  would  predict  that  a subject  responds  with 
indifference  in  the  first  three  row  contexts,  with  a 
preference  for  the  left  gamble  in  the  second  three  row 
contexts,  and  with  a preference  for  the  right  gamble  in 
the  last  three  row  contexts.  Three  subjects  fitted  this 
pattern  perfectly  in  all  of  their  18  responses.  Eight 
subjects  had  only  one  or  two  deviations  from  this  pattern, 
five  subjects  had  three  or  four  deviations,  and  only  two 
subjects  (including  the  indifferent  subject  6)  showed 
more  than  four  violations. 

A more  refined  pattern  analysis  was  based  on  the 


individual  dollar  differences.  For  each  subject  the 
average  dollar  difference  was  computed  for  each  of  the 
three  groups  of  context  rows  in  Table  8.  If  a subject's 
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cash  equivalence  responses  would  agree  with  the  general 
multivariate  risk  aversion  pattern,  his  ranking  of  these 
three  mean  dollar  differences  would  be  as  follows: 
lowest  (negative)  for  the  last  three  contexts,  medium 
(close  to  zero)  for  the  first  three  contexts,  and  largest 
(positive)  for  the  middle  three.  For  15  subjects  the  mean 
dollar  differences  were  ranked  in  exactly  this  way.  Two 
subjects  had  reversed  rankings,  and  subject  6 had  zero 
dollar  differences  for  all  three  context  groups. 

Tables  9 and  10  present  the  results  for  the  row 
matches  (16  0)  ^ (x  x)  and  (10  0)  ^ (x  x)  . The  pattern 

of  preferences  is  not  quite  as  clear  here,  since  the 
bias  for  the  gamble  with  the  matched  value  x starts  to 
dominate  the  picture.  But  while  the  bias  is  clearly  the 
main  feature  for  the  first  and  last  three  context  rows, 
the  middle  three  context  rows  counteract  the  bias.  For 
these  context  rows  the  left  gamble  in  the  check  item 
becomes  again  more  balanced.  Correspondingly  subjects 
tended  to  increase  their  preferences  for  this  gamble. 

To  determine  how  well  individual  response  patterns 
agreed  with  the  general  patterns  reflected  in  Tables  9 and  10, 
individual  average  dollar  difference  were  computed  for 
each  of  the  three  context  row  groups  in  the  two  tables. 

If  there  was  a multivariate  risk  aversion  effect  counter- 
acting the  bias  effect,  one  would  expect  that  these  mean 
dollar  differences  were  ranked  in  the  following  order: 
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Table  9 


Preferences  and  Mean  Dollar  Differences 
as  a Function  of  Context  Rows 
(Match:  (16  0)  ^ (x  x) ; n = 36) 
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Table  10 


Preferences  and  Mean  Dollar  Differences 
as  a Function  of  Context  Rows 
(Match:  (10  0)  “Mx  x)  ; n = 36) 
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lowest  for  the  last  three  context  rows,  middle  for  the 
first  three  context  rows,  and  highest  for  the  middle 
three  context  rows.  This  ranking  would  be  expected, 
since  the  last  three  context  rows  make  the  left  gamble 
more  unbalanced  than  the  first  three  rows,  while  the 
middle  three  context  rows  make  the  left  gamble  more 
balanced.  For  the  (16  0)  •v  (x  x)  match  mean  dollar 

differences  were  ranked  this  way  for  four  subjects,  11  sub- 
jects showed  only  one  reversal,  one  subject  had  two 
reversals,  and  one  had  three.  For  the  (10  0)  % (x  x) 

match  mean  dollar  differences  were  ranked  according  to 
the  multivariate  risk  aversion  pattern  for  seven  subjects, 
eight  subjects  had  one  reversal,  and  two  subjects  had  two. 
Subject  6 was  again  indifferent  in  all  responses  and 
consequently  had  zero  dollar  differences  in  all  cases. 

Although  the  trend  for  multivariate  risk  aversion 
is  by  no  means  as  pronounced  for  these  last  two  matches 
as  it  was  for  the  natural  row  match  (16  0)  ^ (0  16),  it 
seems  to  be  stable  and  persistent  within  subjects.  The 
data  then  suggest  that  row  indpendence  in  its  pure  form 
is  also  violated  in  a systematic  and  context  dependent 
way. 

What  about  violations  of  row  independence  in  its 
utility  independence  form?  Here  the  first  three  context 
rows  of  Tables  9 and  10  are  relevant.  Although  the 
preference  distributions  show  no  clear  trend,  the  mean 
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dollar  difference  increases  as  the  constant  row  context 
increase.  Figure  4 presents  this  trend  graphically 
for  both  matches.  Only  the  maximum  difference  for  the 
beef  trend  is  significant. 

In  the  check  items  that  tested  column  independence 
the  picture  is  dominated  by  the  bias  to  prefer  the 
gamble  with  the  matched  gasoline  amount  (see  Tables  11 
and  12).  For  neither  match  a clear  context  effect  is 
observable.  To  determine  how  well  individual  response 
patterns  agreed  with  the  general  pattern  reflected  in 
Tables  11  and  12,  individual  average  dollar  difference 
were  computed  for  each  of  the  three  context  column 
groups  for  each  subject.  If  there  was  any  effect  of  the 
grouping  of  context  columns,  similar  to  the  multivariate 
risk  aversion  effect,  one  would  expect  some  character- 
istic ranking  of  these  mean  dollar  differences.  In 
fact,  no  such  typical  ranking  could  be  found.  Subjects 
distributed  evenly  over  all  possible  rankings  in  both 
matches.  Thus  it  seems  that  violations  of  column 
independence  are  largely  due  to  the  preference  bias 
and  not  context  dependent. 

Individual  Parameters  and  Independence  Violations. 
These  last  few  result  paragraphs  will  discuss  some 
individual  characteristics  as  related  to  violations 
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Table  11 

Preferences  and  Mean  Dollar  Differences 
as  a Function  of  Context  Columns 
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of  independence  assumptions.  A correlation  analysis 
related  the  previously  described  individual  parameters  (risk 
premium  for  gasoline  and  beef  gambles,  substitution  indices 
for  gasoline  and  beef  with  cash,  and  additivity  indices) 
with  indices  of  independence  violation.  These  indices  are 
the  individual  averages  in  absolute  dollar  differences  for 
row  and  column  check  items  (see  Table  7 that  lists  these 
indices)  and  an  index  of  'multivariate  risk  aversion' . This 
index  was  constructed  as  follows: 


MRA  = 


fie 

°1 

T16  °1 

lo 

loj 

$ Lio  oj 

where  $|^  J stands  for  the  assessed  cash  equivalent.  For 
each  subject  the  index  was  an  average  of  assessments  of  four 
presentations . 

A positive  MRA  index  means  that  subjects  tended  to 
prefer  the  more  balanced  gamble  on  the  left,  a negative 
index  means  that  they  preferred  the  more  imbalanced  gamble 
on  the  right,  and  a zero  index  means  indifference.  Table 
13  presents  the  correlations  of  the  5 independent  indices 
with  the  individual  mean  absolute  dollar  differences  for 
row  and  column  items  and  the  MRA  index.  Since  the  number 
of  subjects  was  small,  the  results  have  to  be  interpreted 
with  caution. 
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The  additivity  index  is  uncorrelated  with  all  three 
indices  of  independence  violations.  From  non-additivity  of 
commodities  one  can  therefore  not  draw  any  conclusions 
about  possible  violations  of  row  or  column  independence. 

The  risk  premium  of  the  gasoline  gamble  (16  0)  is  cor- 

related positively  with  the  MRA  index.  This  indicates  that 
subjects  who  are  risk  averse  in  a single  commodity  are  also 
somewhat  multivariate  risk  averse.  However,  the  risk 
premium  of  the  beef  gamble  (10  0)  is  slightly  negatively 

correlated  with  the  MRA  index,  casting  doubt  on  the 
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generality  of  the  conclusion  that  single  attribute  risk 
aversion  and  multivariate  risk  aversion  go  hand  in  hand. 

Both  risk  premiums  were  negatively  correlated  with  the 
mean  absolute  dollar  differences  of  row  and  column  check 
items.  Subjects  who  were  more  risk  seeking  tended  to  violate 
independence  assumptions  more.  Furthermore,  all  substitution 
indices  are  positively  correlated  with  the  indices  of  inde- 
pendence violations.  Subjects  who  had  a more  pronounced 
decreasing  substitution  rate  between  the  commodities  and 
money  tended  to  violate  row  and  column  independence  less. 

Much  of  the  previous  analysis  has  dealt  with  general 
trends  across  subjects.  In  the  last  few  pages,  some  sub- 
jects will  be  singled  out  who  deviated  from  such  general 
trends,  and  their  responses  will  be  discussed  in  more  detail. 
Subject  6 showed  a very  clear  response  pattern.  He  was 
indifferent  in  all  90  check  items.  He  was  also  the  one 
subject  who  was  risk  neutral  in  both  commodities,  had  almost 
linear  substitution  rates  for  gas  and  beef  with  cash,  and 
was  close  to  additive  in  his  cash  evaluations  of  commodity 
bundles.  Naturally  he  had  the  highest  validity  of  the 
simple  EV  model  described  on  page  45.  it  is  clear  that 
subject  6 used  a simple  pricing  and  expected  value  strategy 
to  determine  his  cash  equivalents.  As  he  pointed  out  in 
the  interview  following  the  experiment,  he  had  recognized 
the  construction  principle  of  the  check  items  and  had  re- 
sponded with  indifference  statements.  It  was  his  opinion 
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that  his  responses  reflected  his  true  preferences,  since 
the  stakes  were  not  large  enough.  When  he  was  presented  in 
a later  session  with  check  items  in  which  all  amour.  ~ were 
multiplied  by  10,  he  showed  exactly  the  response  pattern 
that  most  subjects  displayed  in  the  actual  experiment.  In 
particular,  he  became  risk  averse,  showed  a quite  strong 
decreasing  substitution  rate  of  beef  with  cash,  and  became 
multivariate  risk  averse. 

Subject  16  on  the  other  hand  had  the  highest  violations 
of  row  and  column  independence.  The  most  outstanding 
characteristic  of  his  responses  was  their  extremeness. 

His  cash  equivalents  for  commodity  bundles  were  much  higher 
than  the  cash  equivalents  of  other  subjects,  overshooting 
even  the  market  price  substantially.  He  was  very  strongly 
risk  seeking,  and  he  even  gave  a certainty  equivalent  of  12 
to  the  beef  gamble  (10  0) , a response  that  violated  the 

expected  utility  principle.  He  insisted  on  these  extreme 
responses,  however,  with  the  explanation  that  he  liked  to 
gamble  and  that  he  did  not  care  for  the  actual  outcomes  that 
much.  Besides  the  extremeness  of  his  responses,  subject  16 
was  also  the  most  inconsistent  subject.  His  retest  relia- 
bility for  cash  equivalents  was  only  .26.  Thus  the  high 
violation  of  row  and  column  independence  for  this  subject 
may  be  explained  by  his  erratic  and  extreme  responses. 

Subject  14  showed  the  second  highest  violations  of 
column  independence.  He  was  the  only  subject  who  often 
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responded  with  indifference  to  a check  item,  but  gave  dif- 
ferent cash  equivalents.  In  fact,  he  was  indifferent  among 
all  but  two  column  check  items,  thus  fitting  column  inde- 
pendence very  well  in  terms  of  preferences.  But  his 
strategy  to  determine  independently  his  cash  equivalents  for 
both  gambles  in  the  check  item  did  not  reflect  this  indif- 
ference. The  relatively  high  validity  of  the  simple  EV 
model  (.80)  indicates  that  he  used  some  pricing  strategy  to 
set  cash  equivalents. 

« 

Subject  13  showed  a very  low  degree  of  column  indepen- 
dence violation.  He  also  had  linear  substitution  rates  of 
commodities  with  money  and  was  nearly  additive  in  cash 
equivalent  assessments  for  commodity  bundles.  His  low 
degree  of  column  violation  may  have  been  a consequence  of 
this  simple  linear  additive  strategy  in  commodity  bundle 
evaluations.  His  risk  attitudes  were  not  very  strong  either. 
But  unlike  subject  6 who  showed  a similar  pattern  of  indi- 
vidual parameters,  his  multivariate  risk  aversion  was  very 
strong,  leading  to  his  violations  of  row  independence. 

All  other  subjects  had  response  patterns  that  corre- 
sponded roughly  to  the  general  trends  outlined  in  the  previous 


sections. 


DISCUSSION 


Before  making  an  attempt  to  explain  the  results  of  the 
experiment  and  to  discuss  their  implications  for  descriptive 
and  normative  modelling  of  risky  multiattribute  preferences, 
the  question  of  the  interpretability  and  generalizability  of 
the  experimental  data  will  be  addressed.  Data  from  choice 
experiments  are  susceptible  to  two  kinds  of  problems: 
internal  response  inconsistencies  and  simplifying  response 
strategies.  If  subjects  respond  very  inconsistently,  the 
experimental  results  simply  become  messy  and  possible  experi- 
mental effects  are  buried  under  the  response  errors.  If 
subjects  use  simplifying  strategies,  they  may  be  very  con- 
sistent, but  their  responses  may  not  reflect  their  true 
preferences.  In  either  case  the  interpretation  of  the 
results  is  seriously  hampered. 

Fortunately,  the  experimental  results  show  that  neither 
randomness  nor  simplifying  strategies  dominated  the  response 
patterns.  Except  for  one  subject,  all  subjects  responded 
with  a high  degree  of  internal  consistency.  This  consistency 
was  probably  due  to  the  simple  construction  principle  of  the 
check  items.  Although  gambles  of  the  form  used  in  the 
experiment  may  be  difficult  to  compare  in  general , the 
specific  gambles  that  constituted  the  check  items  had  some 
prominent  features  that  allowed  an  easier  comparison  on  the 
basis  of  straightforward  preference  characteristics. 
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For  example,  subjects  could  rather  easily  decide  among 
pairs  of  gambles  that  tested  the  marginality  assumption,  if 
they  were  multivariate  risk  averse,  and  as  long  as  that 
preference  characteristic  did  not  change,  their  responses 
would  not  change  in  repeated  comparisons.  As  subjects 
mentioned  in  the  post-experimental  interviews,  they  found 
the  preference  tasks  simple,  mainly  because  of  the  ease  with 
which  they  could  establish  some  basic  preference  character- 
istics in  their  choices.  However,  most  subjects  thought 
that  the  cash  equivalent  assessments  for  gambles  was  diffi- 
cult. Why  then  the  relatively  high  consistency  of  cash 
equivalent  assessments? 

It  is  possible  that  subjects  used  some  simple  arith- 
metic to  generate  cash  equivalents.  But  in  the  discussion 
sessions  in  the  training  period  it  became  clear  that,  except 
for  subject  6,  all  subjects  thought  that  averaging,  pricing, 
or  expected  value  strategies  would  misrepresent  their  true 
preferences.  Most  subjects  described  their  cash  equivalent 
assessment  strategies  as  an  'averaging  and  discounting' 
strategy.  In  assessing  certainty  equivalents  they  would 
first  think  about  the  expected  value,  realize  a preference, 
and  adjust  the  cash  equivalent  upward  or  downward  according 
to  the  direction  of  their  initial  preference.  In  assessing 
cash  equivalents  for  commodity  bundles  they  would  first  use 
some  market  price  equivalent  and  then  discount  for  the 
marginal  utility  of  the  commodities.  The  results  reflect 
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these  strategies  quite  well,  in  particular  in  the  risk 
attitudes  of  most  subjects  and  in  the  non-linear  substitu- 
tion rates  of  commodities  with  cash. 

The  manifestation  of  clear  preference  characteristics 
such  as  risk  aversion  (or  risk  seeking)  and  non-linear 
substitution  rates  not  only  leads  to  the  argument  that  sub- 
jects probably  did  not  use  simple  response  strategies.  It 
also  says  something  about  the  generalizability  of  the 
results  beyond  the  simple  stimulus  that  was  used  in  the 
experiment.  If  subjects  had  used  strategies  that  would  only 
be  applicable  to  stimuli  that  allow  linear  trade-offs  among 
commodities,  and  between  commodities  and  cash,  and  if  the 
outcomes  had  been  so  inconsequential  as  to  suggest  expected 
value  computation,  the  generalizability  would  be  very 
limited.  But  since  subjects  did  in  fact  exhibit  quite 
strong  preference  characteristics,  there  is  reason  to 
assume  that  the  results  generalize  to  such  stimuli  that 
create  similar  preference  characteristics,  that  is  all  those 
for  which  risk  attitude,  non-linear  substitution  rates 
and  non-additivity  are  of  a similar  form  as  for  the  gambles 
for  beef  and  gasoline. 

A first  look  at  the  general  pattern  of  the  responses  to 
check  items  revealed  a strong  bias  to  prefer  the  gamble  with 
a matched  value.  This  bias  was  independent  of  context,  only 
to  some  extend  dependent  on  matches,  and  held  for  all  sub- 
jects similarly.  One  possible  explanation  is  that  subjects 
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mismatched,  that  is  the  matched  values  were  different  from 
their  true  indifference  point.  An  explanation  of  such  mis- 
matching would  be  an  'anchoring  and  adjustment'  process. 
Tversky  and  Kahnemann  (1975)  describe  such  processes  in 


judgments  about  uncertainty,  and  they  hypothesize  that 
| several  biasses  in  such  judgments  can  be  explained  by  insuf-  ] 

ficient  adjustment.  In  the  present  experiment  subjects  may 
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have  focussed  or  anchored  on  the  expected  value  of  a gamble 
or  some  price  equivalent  of  a commodity  bundle,  and  adjusted 
that  value  up  or  down  according  to  their  risk  attitude  and 

marginal  utilities  of  commodities.  In  fact,  many  subjects 
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described  their  matching  process  in  somewhat  similar  terms. 

Insufficient  adjustment  in  such  an  'anchoring  and 
adjustment'  process  can  explain  some  of  the  biasses  that  were 
observed  in  the  experiment.  Since  subjects  had  to  match  a 
■ gasoline  amount  against  a beef  amount  in  the  column  matches, 

they  may  have  started  with  a gasoline  amount  that  was 
approximately  matching  the  beef  amount  in  price.  Then  they 
adjusted  the  gasoline  amount  downward  to  account  for  the 
lower  subjective  value  of  beef.  If  such  adjustment  was 

. . I 

insufficiently  large,  they  would  prefer  the  gamble  with  the 
j gasoline  match  in  the  check  items.  In  the  row  matches,  the 

insufficient  adjustment  away  from  expected  values  can  explain 
! the  bias  for  risk  averse  subjects.  Risk  averse  subjects 


would  adjust  the  certainty  equivalent  down  from  the  expected 
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they  would  prefer  the  gamble  with  the  certainty  equivalent 
in  the  check  items.  However,  insufficient  adjustment  would 
predict  the  opposite  effect  for  risk  seeking  subjects.  Risk 
seeking  subjects  would  adjust  their  certainty  equivalent  not 
far  enough  upwards,  and  consequently  tend  to  prefer  the  side 
with  the  gamble  over  the  side  with  the  certainty  equivalent 
in  the  check  items.  Thus  anchoring  and  insufficient  adjust- 
ment cannot  completely  explain  the  bias. 

Another  possible  explanation  for  the  bias  is  offered  by 
Slovic's  (1975)  More  Important  Dimension  hypothesis  (MID 
hypothesis) . According  to  this  hypothesis,  subjects  resolve 
conflicts  about  matched  pairs  by  preferring  the  side  which 
is  better  on  the  more  important  attribute.  This  would 
explain  the  bias  in  the  column  check  items,  since  the  gam- 
ble with  the  matched  value  had  more  gasoline,  a commodity 
that  was  considered  the  more  useful  one  by  all  subjects. 

The  bias  for  the  gamble  with  the  certainty  equivalent  in 
the  row  check  items  may  be  similarly  explained  as  a tendency 
to  prefer  the  side  that  offered  more  certainty. 

So  far  the  experiment  does  not  give  any  conclusive 
explanations  for  the  reasons  for  the  strong  bias  found,  but 
it  raises  the  important  question:  how  well  do  matches  hold 
up  in  actual  choices  when  put  into  different  context? 

Simple  matching  may  involve  processes  that  are  different 
from  the  judgmental  processes  involved  in  the  comparison  of 
complex  gambles.  When  confronted  with  their  bias  in  the 
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interview  after  the  experiment,  several  subjects  explained 
it  with  mismatching.  They  indicated  that  they  would  resolve 
the  inconsistency  by  changing  their  match.  Other  subjects 
pointed  out  that  they  compared  the  complex  gambles  element 
by  element,  which  lead  them  to  different  conclusions  than 
the  comparisons  of  total  rows  and  columns  required  in  the 
matches.  They  were  puzzled  by  the  bias  and  they  were  not 
quickly  ready  to  resolve  it  either  way.  If  forced,  however, 
they  tended  to  be  more  suspicious  of  their  comparisons  be- 
tween check  item  gambles  than  of  their  matches. 

Naturally,  the  bias  overshadowed  some  experimental 
effects.  Nevertheless,  the  more  interesting  results  of  the 
experiment  held  up  against  the  preference  bias.  First  of 
all,  violations  of  row  and  column  independence,  although 
neither  very  large  (in  the  area  of  15%  dollar  difference 
relative  to  the  higher  cash  equivalent),  were  patterned  very 
clearly  for  all  subjects.  The  first  experimental  question 
was:  is  there  any  difference  in  the  leve 1 of  violations  of 

row  and  column  independence?  This  had  to  be  negated  on  t he 
basis  of  the  experimental  data.  But  the  form  of  indepen- 
dence violations  implied  that  row  independence  was  systemati- 
cally violated  overriding  the  bias,  while  column  independence 
violations  were  largely  due  to  the  bias  effect.  Particularly 
interesting  is  the  form  of  violations  of  row  independence, 
which  suggests  that  most  subjects  were  multivariate  risk 
averse.  The  term  multivariate  risk  aversion  had  been 
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introduced  by  Richard  (1975)  in  analogue  to  the  usual  (single 
attribute)  definition  of  risk  aversion.  It  characterizes  a 
behavior  that  shows  preferences  among  lotteries  with  equal 
marginal  distributions.  Multivariate  risk  aversion  leads 
to  a preference  of  the  gamble  that  has  more  balanced  out- 
comes . 

The  explanation  that  subjects  gave  for  their  multi- 
variate risk  aversion  is  persuasive:  they  preferred  the 
gamble  in  which  they  would  win  something  in  both  events,  over 
the  gamble  in  which  they  would  win  a lot  in  one  event,  and 
nothing  in  the  other.  Even  when  the  identical  marginal  dis- 
tributions of  the  check  items  in  the  marginal ity  test  were 
pointed  out  to  them,  they  did  not  change  their  preferences. 
Such  multivariate  risk  aversion  could  be  established  as  a 
persistent  and  intended  preference  characteristic  for  all 
but  one  subject. 

Utility  independence  as  tested  in  the  experiment  could 
not  be  rejected  conclusively.  The  trend  that  showed  up  in 
the  experiment  was  a peculiar  strengthening  of  preferences 
of  the  certainty  equivalent,  if  the  context  rows  were 
increased.  This  means  that  subjects  tended  to  prefer  the 
certain  side  over  the  gamble,  if  they  received  a bonus  in 
addition  to  the  gamble  and  the  certainty  equivalent.  One 
could  make  an  argument  that  the  reverse  effect  should  happen. 
A bonus  in  form  of  some  certain  commodity  amount  may  decrease 
the  risk  aversion  of  a subject  (or  increase  his  risk  seeking 
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attitude) , thus  shifting  his  preference  to  the  side  of  the 
gamble  against  which  his  certainty  equivalent  was  matched, 
v.  Winterfeldt  and  Fischer  (1975)  present  an  example  where 
such  a shift  in  preference  appears  reasonable.  The  opposite 
effect  that  occurred  in  the  experiment  may  have  been  due  to 
the  increased  perception  of  a large  sure  win  on  the  side 
that  included  the  certainty  equivalent. 

These  types  of  violations  of  row  and  column  independence 
in  form  of  biasses  and  multivariate  risk  aversion  were  very 
stable  and  unaffected  by  instructions  or  response  modes. 
Preferences,  cash  equivalents,  and  ratings  showed  basically 
the  same  response  patterns. 

The  relationships  between  preference  characteristics, 
such  as  single  attribute  risk  attitude  and  substitution 
rates  between  commodities  and  cash,  on  one  hand  and  viola- 


tions of  independence  assumptions  on  the  other,  remain  less 
clear.  No  relationship  between  (riskless)  additivity  of 
commodities--expressed  in  terms  of  additivity  of  the  cash 
equivalents — and  row  or  column  independence  violations 
could  be  observed.  Delbeke  and  Fauville  (1974)  had  hypothe- 
sized that  row  independence  in  its  marginality  form  would 
be  violated  more  strongly  if  attributes  were  non-additive 
in  the  riskless  sense.  Their  experimental  data  could  not 
support  this  hypothesis,  and  neither  does  this  experiment. 

In  fact,  the  present  results  suggest  that  the  reason  for 
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violations  of  marginality  (risky  additivity)  are  quite 
independent  from  the  reasons  for  violating  riskless  additi- 
vity. Violations  of  row  independence  in  its  marginality 
form  were  due  to  multivariate  risk  aversion  and  not  to  an 


interaction  or  complementarity  of  the  commodities. 

There  are,  of  course,  commodities  or  attributes  for 
which  such  relationships  may  exist.  An  example  are  com- 
plementary commodities  such  as  shoes.  Presumably  subjects 
would  prefer  a fifty-fifty  gamble  in  which  they  can  win  a 
pair  of  shoes  vs.  nothing  over  a fifty-fifty  gamble  in 
which  they  can  win  the  right  vs.  the  left  shoe.  Complemen- 
tarity of  commodities  leads  then  to  a multivariate  risk 
seeking  behavior.  In  general,  multivariate  risk  attitude 
will  be  compounded  with  riskless  non-additivity.  But  as  the 
experiment  indicates,  multivariate  risk  aversion  can  exist 
as  a pure  phenomenon,  not  influenced  by  riskless  interaction 
phenomena. 

Similar  to  the  relationship  between  riskless  and  risky 
additivity,  one  can  explore  the  relationships  between 
utility  functions  derived  from  riskless  trade-offs  and 
utility  functions  derived  from  lotteries.  In  the  experi- 


ment this  comparison  could  be  done  by  looking  at  the 
relationship  between  substitution  rates  and  risk  premia. 
Both  are  indicators  of  the  non-linearity  of  the  utility 
functions.  Substitution  rates  indicate  how  marginally 
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premia  indicate  how  convex  the  risky  utility  function  is. 

The  experimental  analysis  could  identify  no  relationship 
between  these  two  indicators.  Subjects  who  had  almost 
linear  substitution  of  gasoline  with  cash,  varied  widely  in 
risk  attitude  about  gasoline  gambles.  Similarly,  subjects 
who  all  had  rather  pronounced  nonlinear  substitution  rates 
of  beef  with  cash  were  risk  neutral,  risk  seeking,  or  risk 
averse . 

The  results  of  one  pilot  subject  demonstrate  that  the 
concepts  of  risky  and  riskless  utility  function  have  to  be 
separated  quite  carefully.  This  subject  stated  a certainty 
equivalent  of  3 for  a beef  gamble  (10  0) . Since  she  was 

risk  seeking  in  the  gasoline  gamble  and  also  in  gambles  for 
cash,  this  seemed  peculiar.  As  it  turned  out  she  had  such  a 
strongly  decreasing  marginal  utility  for  beef,  that  any 
increase  beyond  five  pounds  was  worth  nothing  to  her. 

The  (10  0)  beef  gamble  was  worth  not  much  more  than  a 

(5  0)  beef  gamble  to  her,  and  her  low  certainty  equivalent 

said  little  about  her  real  risk  attitude  (in  the  sense  of 
liking  or  disliking  to  take  chances),  but  rather  about  her 
riskless  evaluations  of  beef.  As  in  the  case  of  the  additi- 
vity considerations  where  the  risky  additivity  is  compounded 
with  riskless  additivity,  the  risky  utility  function  derived 
from  lottery  preferences  is  compounded  with  the  riskless 
utility  function.  This  compounding  effect  may  explain  that 
so  many  subjects  had  different  risk  attitudes  in  the  two 


commodities . 
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If  one  accepts  the  conclusion  that  there  is  a com- 
pounding relationship  between  riskless  and  risky  additivity 
and  utility,  how  would  one  interpret  Tversky's  experimental 
results?  First  of  all,  there  is  no  formal  or  behavioral 
reason  for  assuming  an  identity  of  riskless  or  risky  utility 
functions — neither  in  single  attribute  shape,  nor  in  aggre- 
gation form. 

Secondly,  the  validity  of  SEU  is  quite  compatible  with 
differences  in  riskless  and  risky  utility  functions.  There 
are  some  restrictions  to  these  statements  however.  If  the 
riskless  and  the  risky  aggregation  rule  are  additive,  then 
the  formal  properties  of  Model  3 imply  identical  shapes  of 
riskless  and  risky  single  attribute  utility  functions.  If 
they  are  different,  as  the  utility  functions  in  Tversky's 
experiment,  the  SEU  model  will  have  to  be  rejected. 

But  if,  for  example,  the  multiplicative  aggregation  rule  of 
Model  4 holds,  then  the  SEU  assumption  implies  different 
shapes  of  risky  and  riskless  single  attribute  utility  func- 
tions. If  they  are  identical , the  SEU  model  would  have  to 
be  rejected.  The  experimental  results  presented  here  indi- 
cate that  both  aggregation  form  and  shape  of  riskless  and 
risky  utility  functions  are  different.  In  addition,  they 
show  that  Model  3 is  strongly  violated.  This  still  leaves 
room  for  Model  4 to  be  valid  which,  in  fact,  is  not  so 
strongly  violated  by  the  experimental  data. 
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To  summarize  the  discussion  of  the  behavioral  inter- 
pretation of  the  experimental  results: 

1)  Both  row  and  column  independence  are  violated 
through  a bias  in  preferences  that  contradicts 
previous  matching  judgments.  This  bias  may  be 
due  either  to  mismatching  or  a change  in  pref- 
erences and/or  response  strategies.  This  bias 
questions  the  validity  of  Models  1 and  2. 

2)  For  most  subjects  row  independence  is  systematic- 
ally violated  by  multivariate  risk  aversion. 

This  evidence  clearly  contradicts  the  descriptive 
validity  of  Model  1 for  those  subjects.  The 
specific  forms  of  violation  of  marginality  exclude 
Model  3. 

3)  Riskless  and  risky  utility  functions  show  indepen- 
dent and  different  characteristics  both  in  form 

of  aggregation  and  in  the  shape  of  the  utility 
functions  in  single  commodities.  These  differ- 
ences are  compatible  only  with  Models  2 and  4 . 

General  implications  of  the  experimental  results  for 
normative  modelling  of  risky  multiattribute  preferences 
are  limited  by  the  specific  stimuli  used  in  the  experiment. 
Similar  phenomena  as  found  in  the  experimental  analysis 
of  preferences  among  even  chance  gambles  for  gasoline  and 
ground  beef  are  likely  to  occur  only  in  similar  stimulus 
situations,  that  is  when  attributes  are  separable  and  can 
be  traded  off  in  a simple  way  with  money.  But  such  cases 


V* 


► +- 


y~‘- 


Vi 


" 1 


"TZT’ 


86 


are  not  uncommon  in  real  world  decision  making,  and  a 
decision  analyst  can  learn  some  important  lessons  from 
the  experimental  results  for  future  applications  of  risky 
multiattribute  utility  models  in  solving  such  decision 
problems.  The  first  message  is  an  encouraging  repetition 
of  the  results  of  many  sensitivity  analyses:  even  in  the 
presence  of  independence  violations,  simple  models  such  as 
additive  EV  models  predict  preferences  rather  well.  The 
experiment  showed  that  these  models  were  normatively  wrong 
in  the  sense  that  subjects  were  single  attribute  risk 
averse  or  risk  seeking  and  that  they  were  multivariate 
risk  averse.  Although  their  expressed  preferences  among 
gambles  cannot  be  used  as  a normative  criterion  for  the 
validity  of  the  model,  one  would  expect  a normative  model 
to  covariate  with  preference  judgments  in  relatively  simple 
tasks.  Such  covariance  was  found  for  most  subjects. 

Second,  the  overall  level  of  the  observed  violations 
of  model  assumptions  were  not  very  high.  But  the  violations 
of  models — and  thus  the  possible  consequences  of  mismodel- 
ling — were  larger  for  some  models  and/or  some  stimuli. 

These  differences  have  to  be  carefully  considered  in 
building  and  applying  multiattribute  preference  models. 

In  the  discussion  of  the  normative  implications  of 
the  experimental  results,  it  is  important  to  distinguish 
between  violations  of  models  that  represent  biasses  and 
violations  that  are  purposeful  and  consistent.  The  first 
type  of  violations  imposes  problems  of  enforcing 
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Several  important  characteristics  of  risky  multiattribute 
preferences  could  be  demonstrated,  and  their  implications 
for  descriptive  and  normative  modelling  may  be  useful. 

But  many  questions  remain  open,  and  some  have  been  raised 
by  the  experiment: 

1)  What  are  the  precise  behavioral  relationships 
between  risky  and  riskless  utility  functions, 
their  single  attribute  shapes,  and  their 
aggregation  forms? 

2)  What  are  the  reasons  behind  the  exhibited 
biasses:  mismatching,  response  strategy  changes, 
or  real  preference  changes? 

3)  What  are  the  behavioral  correlates  of  multivariate 


risk  aversion? 


APPENDIX 


This  appendix  lists  the  left  and  right  cash  equivalent 
responses  (CL  and  CR)  for  each  of  the  45  check  items, 


both  sessions 


according  to  the  following  order  of  check  items 


Check  item 
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left  right 
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beef.  Subjects  matched  gambles  or  commodity  bundles  against  a standard  and 
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lus contexts  as  required  by  the  independence  assumptions.  Effects  of 
response  modes,  instructions,  and  personal  preference  characteristics  were 
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All  independence  assumptions  and  models  were  violated  by  a bias  to  prefer 
a gamble  or  commodity  bundle  that  was  previously  matched  against  a standard, 
independently  of  context.  Systematic  and  strong  violations  of  the  marginal  - 
ity  assumption  were  found  in  form  of  a multivariate  risk  aver ion:  subjects 
tended  to  prefer  a gamble  with  more  halanced  multiple  outcomes  over  a gamble 
with  extreme  multiple  outcomes,  even  if  all  single  outcomes  had  an  equal 
chance  of  occurring.  Both  the  bias  and  multivariate  risk  aversion  were 
independent  of  response  modes  and  instructions.  Other  preference  character- 
istics such  as  single  attribute  risk  attitude  and  preferential  interaction 
of  commodities  seemed  unrelated  to  multivariate  risk  aversion. 

The  bias  to  prefer  a previously  matched  gamble  over  a standard  cannot  be 
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ences. This  bias  could  be  due  either  to  mismatching  or  to  a change  in 
preferences  after  matching.  The  phenomenon  of  multivariate  risk  aversion 
proved  to  be  a stable  property  of  risky  multiattribute  preferences  for  the 
stimuli  considered.  Descriptive  models  for  risky  multiattribute  preferences 
will  have  to  take  this  phenomenon  into  account  in  similar  stiumulus  situa- 
tions. For  normative  modelling  the  results  of  the  experiment  indicate  the 
necessity  to  carefully  check  the  consistency  of  preferences  assessed  by 
procedures  that  are  based  on  indifference  judgments  and  to  compare  them  with 
actual  choices.  The  multivariate  risk  aversion  effect  suggests  that  simple 
additive  expected  utility  models  may,  in  some  cases,  be  inappropriate  for 
prescribing  preferences.  Checks  of  the  marginal ity  assumption  and  analyses 
of  the  form  of  multivariate  risk  aversion  should  be  designed  and  tested 
carefully,  before  modelling  decision  makers'  preferences  with  additive 
expected  utility  models. 
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